WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Internationa] Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/12, C07K 14/435 



Al 



(11) International Publication Number: WO 00/04142 

(43) International Publication Date: 27 January 2000 (27.01 .00) 



(21) International Application Number: PCT/AU99/00579 

(22) International Filing Date: 16 July 1999 (16.07.99) 



(30) Priority Data: 

PP 4708 



16 July 1998 (16.07.98) 



AU 



(71) Applicants (for all designated States except US): THE COUN- 

CIL OF THE QUEENSLAND INSTITUTE OF MEDICAL 
RESEARCH [AU/AU]; The Bancroft Centre, 300 Herston 
Road, Herston, QLD 4029 (AU). THE CORPORATION 
OF THE TRUSTEES OF THE ORDER OF THE SISTERS 
OF MERCY IN QUEENSLAND [AU/AU]; Aubigny Place, 
Mater Misericordiae Hospitals, South Brisbane, QLD 4J01 
(AU). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): WILLIAMS, Stephanie, 
Jane [AU/AU]; 48 Stevenson Street, The Grange, QLD 4051 
(AU). ANTALIS, Toni, Marie [AU/AU]; 527 Coronation 
Drive, Toowong, QLD 4066 (AU). McGUCKIN, Michael, 
Andrew [AU/AU]; 27 Leigh Street, Coorparoo, QLD 4151 
(AU). GOTLEY, David, Charles [AU/AU]; 100 Birdwood 
Terrace, Auchenflower, QLD 4066 (AU). 



(74) Agent: FISHER ADAMS KELLY; Level 13, AMP Place, 10 
Eagle Street, Brisbane, QLD 4000 (AU). 



(81) Designated States: AE f AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, 
KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZA, 
ZW, ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, 
UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, 
RU, TJ, TM), European patent (AT, BE, CH, CY, DE, DK, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI 
patent (BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, 
NE, SN, TD, TG). 



Published 

With international search report. 

With an indication in relation to deposited biological 
material furnished under Rule 1 3 bis separately from the 
description. 



(54) Title: MUCINS 



(57) Abstract 

prfiated MUC nucleic acids are provided which correspond to a Mucin gene located on human chromosome 7q22, or on a mammalian 
chromosome structurally or functionally equivalent thereto, which Mucin gene is normally predominantly expressed in the colon. Also 
provided are diagnostic and therapeutic uses of isolated MUC nucleic acids, MUC polypeptides encoded thereby and anti-MUC mAb. 




FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


b\ 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


Gil 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UC 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JF 


Japan 


NE 


Niger 


VN 


Vict Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






I)E 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark: 


IK 


Sri Lanka 


SE 


Sweden 








WO 00/04142 PCT/AU99/00579 

1 

TITLE 
"MUCINS" 
FIELD OF THE INVENTION 
THIS INVENTION relates generally to nucleic acids 
5 corresponding to mammalian Mucin genes, and to polypeptides encoded 
thereby. More particularly, the present invention provides isolated nucleic 
acids which correspond to Mucin regulatory genes that are predominantly 
expressed in the colon. These Mucin genes are associated with disease 
conditions including colorectal cancer, breast cancer, cystic fibrosis, 
10 respiratory diseases, inflammatory bowel disease, ulcerative colitis and 
Crohn's disease and/or any other conditions associated with aberrant Mucin 
expression, altered properties of mucus or epithelial inflammatory processes 
involving Mucins. In particular, the present invention provides methods for 
the diagnosis and therapy of the abovementioned disease conditions. 
15 BACKGROUND OF THE INVENTION 

The increasing sophistication of recombinant DNA technology 
is greatly facilitating research and development in the medical and allied 
health fields. This is particularly the case in cancer research. However, 
despite the effectiveness of this powerful technology, progress has been 
20 slow in developing effective recombinant DNA-derived therapeutic or 
diagnostic agents for cancers. One difficulty has been a lack of 
understanding of many cancers and other disease conditions. Regulatory 
genes are an important component of these complex regulatory 
mechanisms. 

25 Cancer suppressor genes, for example, are regulators of cell 

growth and differentiation (Weinberg et a/., 1995, Ann. NY Acad. Sci. 758 
331 ). The paradigm for their role in cancer is that they are trans-acting and 
recessive at the cellular level; loss of one homologue has no effect on cell 
function and homozygous inactivation is required for carcinogenesis 

30 (Cavenee et a/., 1983, Nature 305 779). 

Colorectal cancers contribute to a major proportion of the 



WO 00/04142 PCT/AU99/00579 . 

2 

mortality and morbidity associated with cancer development. There is a 
particular need, therefore, to understand the complex regulatory 
mechanisms associated with colorectal cancers as well as cancers in 
anatomically adjacent regions. 
5 The epithelial mucins are a family of secreted and cell surface 

glycoproteins expressed by epithelial tissues. They are characterised by a 
central polymorphic tandem repeat structure, which comprises most of the 
protein backbone, and a large number of O-linked carbohydrate side chains 
(Gum etal., 1995, Biochem. Soc. Trans. 23 795). The complex structure and 
10 large size of these molecules makes it difficult to characterise them using 
classical biochemical techniques. The genes are also difficult to clone 
because of their large size and the presence of GC-rich tandem repeats. 
Ten mucin genes have been identified; MUC3, MUC4, MUC5AC, MUC5B, 
MUC6 and MUC8 have been partially cloned and full-length cDNA clones 
1 5 are available for MUC1, MUC2, MUC7 and MUC9. 

Mucins are known to contribute to pathology in a number of 
epithelial diseases including cystic fibrosis (CF), inflammatory bowel disease 
(IBD) and adenocarcinomas. Gastrointestinal mucins which have been 
described to date include: the transmembrane mucins MUC1 and MUC4; the 
20 gel-forming mucins MUC2, MUC5AC and MUC6; and MUC3 which has an 
unclear structure and function. 

As used herein, Mucin genes or isolated nucleic acids 
corresponding thereto will be expressed in italicized form as MUC. Mucin 
polypeptides will be expressed as MUC. 
25 Immunohistochemical staining and Western blotting analysis 

with mature MUC1-specific antibodies revealed that MUC1 became 
ectopically expressed in colorectal tumours and levels were significantly 
higher in primary tumours of patients with metastases. Experimentally 
increased expression of gel-forming mucins resulted in increased metastasis 
30 in colon cancer cells in xenograft metastasis models (Ho ef a/., 1995, Int. J. 
Oncol. 7 913). Northern blot analysis has been employed to investigate 
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expression of MUC1, MUC2, MUC3 and MUC4 in paired normal and colonic 
tumour tissues and in nine colorectal cancer (CRC) cell lines (Ogata ef a/., 
1992, Cancer Res. 52 5971). MUC1 and MUC4 were present in colonic 
mucosa with similar expression levels in carcinomas, but occasionally 
5 elevated levels of MUC4 were apparent. Levels of MUC2 and MUC3 were 
decreased by varying degrees in the tumours of most patients. There was 
no apparent correlation between the expression of any mucin gene and the 
site, stage or histological type of tumour. All four mucin genes were 
expressed at low levels or not at all in the nine CRC cell lines under 
10 investigation; MUC1 transcripts were detected in COLO205, MUC2 and 
MUC4 probes hybridised weakly to all nine cell lines, and MUC3 expression 
was observed in five of the lines. Using a combination of in situ hybridisation 
and immunohistochemistry, Chang et al (Chang et al., 1994, 
Gastroenterology 107 28) also found MUC2 and MUC3 were downregulated 
1 5 in CRC. A more recent in situ hybridisation study found expression of MUC2 
and MUC3 mRNA was markedly reduced in poorly, moderately and well- 
differentiated colorectal tumours but preserved in mucinous carcinomas 
(Weiss et al., 1996, J. Histochem. Cytochem. 44 1161). It is noted that 
MUC3 is located on human chromosome 7q22, or an equivalent location on 
20 other mammalian chromosomes, and is primarily expressed under normal 
conditions in the small intestine (Shekels ef a/., 1998, Biochem J. 330 1301). 

OBJECT OF THE INVENTION 
The present inventors have realized that the Mucins constitute 
an incomplete family of genes and gene products implicated in a variety of 
25 disease conditions. Surprisingly, the present inventors have identified novel 
Mucin genes located on human chromosome 7q22, and isolated novel 
nucleic acids corresponding thereto. Furthermore, the present inventors 
have found that these novel Mucin genes are predominantly expressed in 
the colon, and may be involved in cancer of the large bowel, cystic fibrosis, 
30 breast cancer, inflammatory bowel disease, ulcerative colitis respiratory 
diseases and Crohn's disease and/or any other conditions associated with 
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aberrant Mucin expression, altered properties of mucus or epithelial 
inflammatory processes involving Mucins. 

It is therefore an object of the invention to provide novel Mucin 
genes and isolated nucleic acids corresponding thereto. 
5 SUMMARY OF THE INVENTION 

The present invention is broadly directed to an isolated MUC 
nucleic acid which corresponds to a MUC gene located on mammalian 
chromosome 7q22, or on a mammalian chromosome structurally or 
functionally equivalent thereto, which MUC gene is normally predominantly 

10 expressed in the colon. 

In a first aspect, the MUC gene of the present invention is 
MUC11. Accordingly, "a MUC11 nucleic acid" means an isolated nucleic 
acid of the invention which corresponds to the MUC11 gene. 

Preferably, the isolated MUC11 nucleic acid comprises a 
15 nucleotide sequence encoding an amino acid sequence which comprises 
SGLSEESTTSHSSPGSTHTTLSPASTTT (SEQ ID NO: 1). 

More preferably, the isolated MUC11 nucleic acid comprises 
a nucleotide sequence encoding the amino acid sequence according to SEQ 
ID NO:3. 

2 0 Even more preferably, the isolated MUC11 nucleic acid 

comprises a nucleotide sequence according to SEQ ID NO: 2. 

In a second aspect, the MUC gene of the present invention is 
MUC12. Accordingly, "a MUC12 nucleic acid" means an isolated nucleic acid 
of the invention which corresponds to the MUC12 gene. 
25 Preferably, the isolated MUC12 nucleic acid comprises a 

nucleotide sequence encoding an amino acid sequence which comprises 
SGLSQESTTFHSSPGSTETTLAPASTTT (SEQ ID NO: 4). 

More preferably, the isolated MUC12 nucleic acid comprises 
a nucleotide sequence encoding the amino acid sequence according to SEQ 
30 ID NO:6. 

Even more preferably, the isolated MUC12 nucleic acid 
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comprises a nucleotide sequence according to SEQ ID NO: 5. 

In a third aspect, the present invention resides in an isolated 
MUC polypeptide. 

In one embodiment, the isolated MUC polypeptide has an 
5 amino acid sequence according to SEQ ID NO: 3, hereinafter referred to as 
a"MUC11 polypeptide". 

In another embodiment, the isolated MUC polypeptide has an 
amino acid sequence according to SEQ ID NO:6, hereinafter referred to as 
a"MUC12 polypeptide". 
10 In a fourth aspect, the present invention resides in an antibody 

specific for a MUC polypeptide (hereinafter referred to as an anti-MUC 
antibody). 

Preferably, the anti-MUC antibody is selected from the group 
consisting of:- 

15 (j) an anti-MUC11 IgM monoclonal antibody hereinafter 

referred to as M1 1 .9; and 
(ii) an anti-MUC12 IgM monoclonal antibody hereinafter 

referred to as M 12. 15. 
In a fifth aspect, the present invention resides in methods of 
20 detecting a MUC gene, a MUC gene transcript or a MUC polypeptide. The 
fifth aspect extends to methods for detecting a polymorphism, deletion, 
mutation, truncation or expansion in a MUC gene, a MUC gene transcript or 
a MUC polypeptide, or detecting a level of expression thereof. One 
embodiment of the fifth aspect is directed to use of an isolated MUC nucleic 
25 acid to determine whether a mammal has a disease condition, or a 
predisposition thereto. Another embodiment is directed to use of an isolated 
MUC polypeptide to determine whether a mammal has a disease condition, 
or a predisposition thereto. 

In a sixth aspect, the present invention provides a method of 
30 gene therapy of a disease condition in a mammal, said method including 
administering to said mammal a gene therapy construct which includes an 
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isolated MUC nucleic acid as hereinbefore defined, to thereby alleviate one 
or more symptoms of said disease condition in said mammal. 

In a seventh aspect, the present invention provides a method 
of treating a disease condition in a mammal, said method comprising the 
5 step of administering to said mammal a pharmaceutically effective amount 
of a MUC polypeptide or an anti-MUC antibody. 

In an eigth aspect, the present invention resides in a 
pharmaceutical composition comprising a MUC polypeptide or anti-MUC 
antibody, together with a pharmaceutically acceptable carrier and/or diluent. 
10 Preferably, the mammal is a human. 

As used herein, the "disease condition" is associated with 
aberrant Mucin expression, altered properties of mucus or epithelial 
inflammatory processes involving Mucins. 

Preferably, the disease condition is selected from the group 
15 consisting of colorectal cancer (CRC), cystic fibrosis (CF), inflammatory 
bowel disease (IBD), breast cancer (BC), Crohn's disease, ulcerative colitis, 
asthma and chronic bronchitis. 

More preferably, the disease condition is selected from the 
group consisting of colorectal cancer (CRC), cystic fibrosis (CF), 
20 inflammatory bowel disease (IBD) and breast cancer (BC). 

As used herein, unless the context requires otherwise, the 
word "comprise", or variations such as "comprises" or "comprising", will be 
understood to imply the inclusion of a stated element or integer or group of 
elements or integers but not the exclusion of any other element or integer or 
25 group of elements or integers. 

RRIFF DESCRIPTION OF THE D RAWINGS 
FIG. 1(A): Autoradiograph of a differential display gel showing amplified 
products from RNA isolated from matched normal colon (N) 
and primary colorectal tumor (P) tissues. Differentially 
30 expressed bands dd29 (MUC12) and dd34 (MUCH) are 

arrowed. 



WO 00/04142 PCT/AU99/00579 

7 

FIG. 1(B): Northern blot analysis of total RNAfrom patient 101 hybridized 
with the dd29 probe to detect a MUC12 gene transcript 
(mRNA). Signal corresponding to 18S ribosomal RNA is shown 
as a loading control. 

5 FIG. 1(C): Northern blot analysis of RNA from patient 1 1 2 hybridized with 
the dd34 probe to detect a MUC11 gene transcript (mRNA) . 
Signal corresponding to 18S ribosomal RNA is shown as a 
loading control. 

FIG. 1(D): Multiplex semi-quantitative RT-PCR showing amplification of 
10 MUC12 mRNA transcripts from matched normal colonic 

mucosa and primary tumor # 40, normal mucosa from patient 
# 81 and six colorectal cancer cell lines. Amplification of p 2 - 
microglobulin (P 2 -MG) is included as a measure of total RNA. 
FIG. 1(E): Multiplex semi-quantitative RT-PCR showing amplification of 
1 5 MUC1 1 mRNA transcripts in matched normal colonic mucosa 

and primary tumors of patients # 40, 164, and 97 and six 
colorectal cancer cell lines. Amplification of p 2 -microglobulin 
(p 2 -MG) is included as a measure of total RNA. 
FIG. 1(F): Multiplex semi-quantitative RT-PCR showing amplification of 
2 0 MUC12 mRNA transcripts from matched normal colonic 

mucosa and primary tumors # 346, 84, 128, 97 and 316 and 
from five unpaired Dukes' stage D tumors (M) # 93, 361 , 107, 
357 and 367. Amplification of p 2 -microglobulin (p -MG) is 
included as a measure of total RNA. 

Multiplex semi-quantitative RT-PCR showing MUC11 mRNA 
transcripts in matched normal colonic mucosa and primary 
tumors of patients #110, 346, 84, 128, and 348 and from five 
unpaired Dukes' stage D tumors (M) # 93, 107, 361, 367 and 
357. Amplification of p 2 -microglobulin (P 2 -MG) is included as 
a measure of total RNA. Ma denotes molecular size markers 
in FIG 1D-G. 



25 FIG. 1(G): 



30 
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FIG. 2: Predicted amino acid sequence of MUC12. Numbering of 
amino acids is given on the right. The consensus sequence of 
the degenerate tandem repeat structure is shown at the top. 
The two cysteine-rich EGF-like domains are double 
5 underlined, a potential coiled-coil domain is in bold, the 

hydrophobic domain singly underlined and potential N- 
glycosylation sites shaded. The stop codon is denoted by an 
asterisk. 

FIG. 3: Amino acid sequence alignment of the carboxyl termini of 
10 MUC12, hMUC3 (amino acids 1-366), mMuc3 (Shekels etal., 

1998, supra; amino acids 637-1015), rMuc3 (Gum et a/., 1991 , 
supra; Khatri ef a/., 1997, Biochem. Biophys. Acta 1326 7; 
amino acids 356-447 and 1-379 respectively), hMUC4 
(Moniauxef a/., 1998, Biochem. J. 338 1998; amino acids 861- 
15 1156) and rMuc4 (Sheng et a/., 1992, J. Biol. Chem. 267 

16341; amino acids 451-744). Light shading demonstrates 
identity with MUC12 and dark shading highlights all cysteine 
residues. Hyphens indicate gaps inserted to optimize the 
alignment. 

20 FIG. 4: Predicted amino acid sequence of MUCH showing the 

degenerate tandem repeat structure. The consensus 
sequence is shown at the top and amino acids not consistent 
with this sequence are shown in bold. Hyphens indicate gaps 
placed in order to optimize the amino acid alignment. A 

25 potential N-glycosylation site is shaded. 

FIG. 5: mRNA tissue distribution of the 7q22 mucin gene family. Only 
those tissues showing a positive signal by Northern blot 
analysis are represented in the histogram. Sixteen tissues of 
neural origin, heart, aorta, skeletal muscle, bladder, stomach, 

30 testis, ovary, spleen, pituitary gland, adrenal gland, thyroid 

gland, salivary gland and mammary gland were negative for 
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mucin mRNA expression. Expression was quantified by 
densitometry and is shown as a proportion of the tissue showing 
highest expression. 

Domain organization of the C-termini of human MUC12, 
hMUC3, the rodent Muc3 mucins and the rat and human MUC4 
mucins. The relative size of domains is accurate except that the 
N-glycosylated domain adjacent to the mucin domain in MUC4 
is shown at approximately one fifth of its actual size. Only the 
beginning of the large mucin domains are shown. 
Alignment of the first extracellular EGF-like domain of MUC12 
with human EGF-like growth factors. Dark shading highlights 
identical amino acids and light shading indicates conservative 
amino acid substitutions. 

Schematic representation of MUC 11 cloning (A) and MUC 12 
cloning (B). 

Normal colonic expression patterns of MUC11 (A, B) and 
MUC12 (C) polypeptides as determined by anti-MUC mAb 
M11.9 and M12.15 immunostaining, respectively. (D) shows 
MUC 1 1 gene transcript (mRNA) expression detected by in situ 
hybridization in normal colonic epithelium and loss of 
expression in CRC (top right). 

Expression of MUC11 and MUC12 mRNA in normal colon as 
detected by RT-PCR. Cytokeratin 20, (CK20) a colonic 
epithelial marker, was employed as a loading control. 'RC 
denotes right colon, TC the transverse colon, 'LC the left 
colon, "SC sigmoid colon; 'CA' refers to the caecum and 'R' 
denotes the rectum. 

Expression of MUC11 and MUC12 mRNA in CRC cell lines as 
detected by RT-PCR. The loading control is p 2 -microglobulin 
(B2MG) and M denotes the molecular weight marker. 
Expression of MUC11 and MUC12 mRNA in IBD as detected 
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by RT-PCR. Cytokeratin 20 (CK20) a colonic epithelial marker, 
was employed as a loading control. 'NT denotes tissues which 
appear macroscopically normal and 'D' refers to tissues 
reported to have IBD. 'CA' refers to the caecum, 'CO' the colon, 
'LC the left colon, TC the transverse colon, 'RS' the recto- 
sigmoid colon, 'SI' the small intestine, 'IL'denotes the ileum and 
'IP' an ileal pouch. 

FIG. 13: Expression of MUC1 1 and MUC12 mRNA in BC as detected by 
RT-PCR. The loading control is p 2 microglobulin denoted by 
B2MG and the molecular weight marker is denoted by 'M'. The 
positive control was normal colonic cDNA from patient 164. 
FIG 14: Northern blot analysis of MUC11 expression in normal colon 
(N) and primary CRC (P) of six patients, assessed using a 
probe corresponding to dd34. The position of ribosomal RNAs 
are indicated, and signal from 18S ribosomal RNA was used as 
a loading control. 
DFTAII FD DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is predicated in part on the identification 
of novel MUC11 and MUC 72 genes which are normally predominantly 
expressed in the colon. The isolated MUC nucleic acids and MUC genes of 
the invention may be useful in treatment and diagnosis of disease conditions 
associated with aberrant Mucin expression, altered properties of mucus or 
epithelial inflammatory processes involving Mucins. Such disease conditions 
include but are not limited to cancer of the large bowel (CRC), cystic fibrosis 
(CF), inflammatory bowel disease (IBD), respiratory diseases such as asthma 
and chronic bronchitis, breast cancer (BC), ulcerative colitis and Crohn's 
disease. 

The present invention is particularly directed to cancers of the 
large bowel, which includes the colon, rectum and anal canal, such as CRC, 
although it extends to biochemically, physiologically and/or genetically 
related cancers in other parts of the gastrointestinal tract. 
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The MUC genes are, for example, down-regulated in CRC. 
By "predominantly expressed" is meant that a MUC gene 
transcript or MUC polypeptide encoded by said MUC gene is expressed in 
the colon at a level greater than in any other organ. 
5 By "associated with" is meant that the disease condition 

displays symptoms consistent with aberrant Mucin expression, altered 
properties of mucus or epithelial inflammation involving Mucins. The disease 
association may be merely correlative or may reflect a causative role of 
Mucins in the disease condition. 
10 The term "cancer" is used in its broadest sense to include 

malignant tumours, carcinomas and sarcomas. 

In light of the foregoing, it will be appreciated that a MUC 
nucleic acid "corresponds to" a MUC gene by being an isolated nucleic acid 
derived from said MUC gene, or a portion thereof. Thus it will be understood 
1 5 that said gene has components including amino acid coding sequences and 
non-coding sequences. Non-coding sequences include, for example, introns 
and regulatory sequences which include a promoter, translation initiation and 
termination sequences and a polyadenylation sequence, for example. The 
isolated MUC nucleic acid may therefore correspond to some or all of the 
20 aforementioned components of the corresponding MUC gene. 

It should be noted that MUC terminology has recently 
undergone revision. In particular, MUC12 was formerly known as dd 29 or 
MUC10. Also, MUC1 1 was formerly known as dd 34. Therefore, with this in 
mind, should the term 'MUCIO'or "dd29" be encountered herein, it should 
25 in all cases be taken to mean MUC12. 

It will also be understood that a MUC polypeptide is encoded 
by an isolated MUC nucleic acid or by a MUC gene as hereinbefore defined. 

Isolated MUC nucleic acids of the invention may be in DNA 
(e.g. cDNA or genomic DNA), RNA (e.g. mRNA) or hybrid DNARNA form, 
30 eithre in double-stranded or single-stranded form. For example, single- 
stranded MUC nucleic acids include nucleic acids having sequences 
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complementary to the nucleotide sequences of SEQ ID NO:2 and SEQ ID 
NO:5. 

In one embodiment, the isolated MUC nucleic acid of the 
invention comprises a nucleotide sequence having at least 60% identity to 
5 the nucleotide sequence according to SEQ ID NO:2, or a nucleotide 
sequence capable of hybridizing thereto under at least low stringency 
conditions. 

In another embodiment, the isolated MUC nucleic acid of the 
invention comprises a nucleotide sequence having at least 60% identity to 
10 the nucleotide sequence according to SEQ ID NO:5, or a nucleotide 
sequence capable of hybridizing thereto under at least low stringency 
conditions. 

According to these embodiments, it is preferable that the 
nucleotide sequence has at least 75% identity. 
15 More preferably, the nucleotide sequence has at least 90% 

sequence identity. 

The term "identity" is used herein in its broadest sense to 
include the number of exact nucleotide or amino acid matches having regard 
to an appropriate alignment using a standard algorithm, such as but not 

20 limited to the Geneworks program (Intelligences). For this purpose, BLAST 
family programs may also be useful (Altschul et a/., 1997, Nucl. Acids Res. 
25 3389, which is herein incorporated by reference). A detailed discussion 
of sequence analysis can be found in Unit 1 9.3 of CURRENT PROTOCOLS 
IN MOLECULAR BIOLOGY, Eds Ausubel et a/., (John Wiley & Sons), which 

25 is herein incorporated by reference. 

According to these embodiments, it is preferable that the 
nucleotide sequence is capable of hybridizing under medium stringency 
conditions. 

More preferably, the nucleotide sequence is capable of 
30 hybridizing under high stringency conditions 

Reference herein to low stringency conditions includes and 
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encompasses from at least about 1 % v/v to at least about 1 5% v/v formamide 
and from at least about 1 M to at least about 2 M salt for hybridisation at 
42°C, and at least about 1 M to at least about 2 M salt for washing at 42°C. 

Low stringency conditions also include 1% Bovine Serum 
5 Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for 
hybridization at 65°C, and (i) 2xSSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM 
EDTA, 40 mM NaHP0 4 (pH 7.2), 5% SDS for washing at room temperature. 

Medium stringency conditions include and encompass from at 
least about 16% v/v to at least about 30% v/v formamide and from at least 
1 0 about 0.5 M to at least about 0.9 M salt for hybridisation at 42°C, and at least 
about 0.5 M to at least about 0.9 M salt for washing at 42°C. 

Medium stringency conditions also include 1 % Bovine Serum 
Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for 
hybridization at 65°C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM 
1 5 EDTA, 40 mM NaHP0 4 (pH 7.2), 5% SDS for washing at 42°C. 

High stringency includes and encompasses from at least about 
31 % v/v to at least about 50% v/v formamide and from at least about 0.01 M 
to at least about 0.15 M salt for hybridisation at 42°C, and at least about 0.01 
M to at least about 0.15 M salt for washing at 42°C. 
20 High stringency also includes 1% BSA, 1 mM EDTA, 0.5 M 

NaHP0 4 (pH 7.2), 7% SDS for hybridization at 65°C, and (i) 0.2 x SSC, 0.1% 
SDS; or (ii) 0.5% BSA, 1mM EDTA, 40 mM NaHP0 4 (pH 7.2), 1% SDS for 
washing at a temperature in excess of 65°C. 

In general, washing is carried out at T m = 69.3 + 0.41 (G + C) 
25 % = -1 2°C. However, the T m of a duplex DNA decreases by 1 °C with every 
increase of 1 % in the number of mismatched based pairs. 

Although the MUC genes and isolated MUC nucleic acids of 
the present invention are exemplified in relation to the human mammalian 
species, the present invention extends to orthologs in non-human mammals 
30 such as in primates, laboratory test animals (e.g. mice, rates, rabbits, guinea 
pigs, hamsters), companion animals (e.g. dogs, cats), livestock animals (e.g. 
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sheep, pigs, horses, donkeys, cows) and captive wild animals (e.g. deer, fox). 

In light of the foregoing, the term "MUC homologs" is used to 
encompass MUC orthologs, isolated nucleic acids which hybridize to MUC 
nucleic acids of the invention and isolated nucleic acids which display at 

5 least 60% sequence identity to isolated MUC nucleic acids. 

It will also be appreciated that MUC homologs encompass 
single or multiple nucleotide substitutions, deletions and/or additions to the 
isolated MUC nucleic acids of the invention, inclusive of mutants, fragments, 
parts, portions and segments of the nucleotide sequences of the invention. 
10 The isolated MUC nucleic acids of the present invention and 

homologs thereof therefore include oligonucleotides, primers (such as for 
PCR), antisense sequences, molecules suitable for use in co-suppression 
and fusion nucleic acid molecules. Ribozymes are also contemplated by the 
present invention. It will be understood that probes, primers and antisense 

1 5 sequences correspond to distinct portions of isolated MUC nucleic acids of 
the invention, in that they contain nucleotide sequences based on said 
distinct portions of an isolated MUC nucleic acid sequence. Such probe and 
primer sequences may be based on a MUC sequence of the invention by 
being identical thereto, or by being degenerate with respect thereto. 

20 As used herein, "oligonucleotides" are nucleic acids which 

comprise a contiguous sequence of no more than seventy (70) nucleotides, 
whereas "polynucleotides" are nucleic acids which comprise a contiguous 
sequence of more than seventy (70) nucleotides. A "probe" may be an 
oligonucleotide or a polynucleotide, either double-stranded or single- 

25 stranded, for use in hybridization techniques such as Northern blotting, 
Southern blotting or in situ hybridization. The skilled person will realize that 
in situ hybridization also includes Fluorescence In Situ Hybridization (FISH), 
which is used for determining chromosomal localization. In situ hybridization 
techniques applicable to the present invention will be described in detail 

30 hereinafter. 

A "primer" is a nucleic acid (usually an oligonucleotide) capable 
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of annealing to a nucleic acid template under appropriate conditions of ionic 
strength and temperature, which annealed primer can be extended in a 
template-dependent fashion by a suitable nucleic acid polymerase (for 
example Taq polymerase or Sequenase™). It will therefore be understood 
5 that primers of the invention may be useful for PCR, sequencing, RACE, 
primer extension and the like. 

In use, isolated MUC nucleic acids, probes and primers may be 
modified such as by end-labeling with 32 P-ATP and T4 polynucleotide kinase 
or by random primed labeling with 32 P-dCTP and DNA polymerase. 
1 0 Biotinylation is also contemplated, as is modification with phosphorothiorates, 
fluorochromes, digoxigenin, enzymes and peptides, for example. 

It is contemplated that diagnostic methods may be employed 
which utilize isolated MUC nucleic acids of the present invention, or portions 
thereof such as probes and PCR primers. Also, diagnostic methods 
1 5 employing MUC polypeptides will be discussed in more detail hereinafter. 

Diagnostic methods may include detection of MUC genes, 
transcripts and/or polypeptides in samples such as fecal specimens and/or 
in colonic biopsies, analysis of serum MUC levels in patients with epithelial 
diseases including cancers, breast tissue biopsy samples or in respiratory 
20 mucus samples from patients suffering from CF, asthma or chronic bronchitis. 

The diagnostic methods of the present invention may therefore 
be applicable to determining whether an individual has a disease condition 
associated with aberrant Mucin expression, altered properties of mucus or 
epithelial inflammatory processes involving Mucins, or a predisposition to 
25 said disease. It will be appreciated that "predisposition" as used herein refers 
to an increased probability that an individual will contract the disease. 
However, it will also be appreciated that the diagnostic methods may also 
indicate whether an individual actually suffers from the disease, assist in 
assessing the severity of disease, a prognosis of the likely course of disease 
30 and appropriate treatments for the disease. Thus, the diagnostic methods of 
the invention may be useful whether or not the individual suffers from one or 
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more symptoms of the disease. 

The present invention therefore contemplates methods of 
detecting MUC genes and MUC gene transcripts (e.g. mRNA), such as 
involving hybridization techniques (for example, by Northern or Southern 
5 blotting or in situ hybridization) or polynucleotide sequence amplification 
techniques (for example RT-PCR). Such methods may detect: - 

(i) a polymorphism, deletion, mutation, expansion, and/or 
truncation in a MUC gene or MUC gene transcript; and 

(ii) a relative level of expression of a MUC gene transcript 
1 o (an mRNA transcript derived from a MUC gene). 

Such methods of detection facilitate determination of whether 
said MUC gene is aberrantly-expressed as an indication of a disease 
condition or a predisposition thereto. Also, MUC gene polymorphisms, 
deletions, mutations, truncations or deletions may be detected which indicate 
1 5 a disease condition or a predisposition thereto. 

It will be appreciated, for example, that measurement of a 
relative level of expression of a MUC gene transcript facilitates diagnostic 
assessment of whether MUC gene expression is downregulated and thereby 
indicative of CRC. 

20 Although PCR is the preferred nucleic acid sequence 

amplification technique, It will be appreciated that there are a variety of 
polynucleotide sequence amplification techniques other than PCR, which 
include rolling circle amplification (RCA) and strand displacement 
amplification (SDA). With regard to RCA, reference is made to W097/19193 

25 which is herein incorporated by reference. With regard to SDA, reference is 
made to U.S. Patent No. 5455166, which is herein incorporated by reference. 

Detailed PCR methods are provided hereinafter, although the 
skilled person is also referred to Chapter 15 of CURRENT PROTOCOLS IN 
MOLECULAR BIOLOGY, Eds Ausubel et a/., (John Wiley & Sons), which is 

30 herein incorporated by reference, for a detailed discussion and examples of 
PCR methods. 
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It will also be understood that PCR includes within its scope 
RT-PCR and multiplex PCR as will be described in detail hereinafter. Such 
methods may be used for qualitative or semi-quantitative analysis. PCR- 
based Restriction Fragment Length Polymorphism (PCR-RFLP) methods are 

5 also contemplated, which methods are useful when a polymorphism, deletion 
mutation, truncation and/or expansion either introduces or removes one or 
more restriction endonuclease sites in a MUC gene. 

The skilled person will appreciate that Northern, Southern and 
in situ hybridization methods involve formation of a hybrid nucleic acid 

1 0 comprising a MUC gene or mRNA transcript and a corresponding isolated 
MUC nucleic acid or portion thereof. 

RNA isolation and Northern hybridization methods are 
described in detail herein, although the skilled person is also referred to 
Chapter 4 of CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Eds 

15 Ausubel et al., (John Wiley & Sons), which is herein incorporated by 
reference. 

Furthermore, Southern hybridization methods are described in 
detail herein, although the skilled person is also referred to sections 2.9A-B 
and 2.10 of CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, Eds 
20 Ausubel et al., (John Wiley & Sons), which is herein incorporated by 
reference. 

Also, determining whether a MUC gene or MUC gene transcript 
includes a polymorphism, mutation, deletion, truncation and/or expansion can 
be performed using methods such as PCR-RFLP analysis, Single Strand 

25 Conformational Polymorhpism (SSCP) analysis and Denaturing Gradient Gel 
Electrophoresis (DGGE). These techniques have become well known in the 
art of mutation detection. A non-limiting example of DGGE is provided in 
Folde & Loskoot, 1994, Hum. Mut. 3 83, which is herein incorporated by 
reference. A non-limiting example of specific allele detection by PCR-RFLP 

30 and SSCP is provided in Lappalainen et al. , 1 995, Genomics 27 274, which 
is herein incorporated by reference. 
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It is proposed that mutations in MUC11 or MUC12 genes are 
associated with bowel cancers (CRC), CF, BC, IBD, chronic bronchitis, 
asthma, ulcerative colitis and/or Crohn's disease. These are examples of 
disease conditions associated with aberrant Mucin expression, altered 
5 properties of mucus or epithelial inflammatory processes involving Mucins. 
The isolated MUC nucleic acids now provide a means for genetic screening 
of the abovementioned disease conditions in human and other mammalian 
species. Genetic screening may be conducted by determining full expression 
or full-length transcript production by Northern blot, cloning and sequencing 
1 0 of the MUC genes or identifying mutations by oligonucleotide hybridisation 
or by direct sequencing of PCR amplification products of the MUC genes. In 
addition, the present invention extends to nucleic acid molecules having 
translation-terminating mutations leading to truncation mutants. The detection 
of truncation mutants may be important for genetic analysis of people with, 
1 5 for example, cancer of the large bowel or with a propensity to develop large 
bowel cancer, determined on, for example, hereditary grounds. 

Truncated MUC polypeptides may also be useful in developing 
therapeutic agents such as antagonists or for developing antibodies. 
Truncational mutants may be readily detected by a direct protein truncation 
20 test. In essence, DNA fragments including PCR amplification products or 
corresponding mRNA molecules are subjected to in vitro translation and 
optionally also transcription and the translation products assayed by, for 
example, SDS-PAGE or by differential antibody binding assays. This assay 
may also be employed to screen for agents capable of inducing truncation 
25 mutations or for agents acting as antagonists for truncation mutant-inducing 
agents. 

Alternatively, MUC polypetides may be assayed by, for 
example, by antibody screening such as in an ELISA. 

Thus, it will be appreciated that the present invention 
30 contemplates isolated MUC polypeptides, and also:- 

(i) polypeptides which comprise an amino acid sequence 
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having at least 60% identity to a MUC polypeptide 
amino acid sequence, preferably at least 75% identity 
thereto, or more preferably at least 90% identity thereto; 
and 

5 (ii) polypeptides encoded by MUC homologs. 

Such polypeptides are hereinafter referred to as "MUC 

homologs". 

The MUC polypeptide homologs of the invention include amino 
acid substitution(s), deletion(s) and/or addition(s) to a MUC polypeptide 
1 0 sequence. Particular examples include antigenic fragments and analogues 
useful in immunoassays and as therapeutic agents as well as other 
fragments carrying B cell and/or T cell linear or conformational epitopes. 
Additions to the amino acid sequence include fusion partners in the form of 
peptides or polypeptides, which create a MUC fusion polypeptide. 
1 5 Fusion polypeptides include the MUC polypeptide(s) together 

with fusion partners such as HIS 6 , glutathione-s-transferase (GST), 
thioredoxin (TR) and maltose binding protein (MBP). Fusion partners greatly 
assist recombinant synthetic polypeptide purification by virtue of each fusion 
partner affording affinity purification by a specific affinity matrix. Preferably, 
20 the fusion polypeptide also includes a protease-specific cleavage site, so that 
the fusion partner may be cleaved and removed following purification to leave 
a substantially unmodified MUC polypeptide. 

The use of fusion partners for purification of recombinant 
expressed polypeptides is well known in the art. Indeed, there are a variety 
25 of commercial sources applicable to fusion partners and purification systems 
such as the QIAexpress™ (HIS) 6 system, the Pharmacia GST purification 
system and the New England Biolabs MBP system. 

Also within the scope of fusion partners are "epitope tags". 
Such tags are well known in the art and include c-myc, influenza 

30 hemagglutinin and FLAG tags. 

Furthermore, Green Fluorescent Protein (GFP) is a well known 
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fusion partner applicable to MUC polypeptides of the invention. A particularly 
useful application of GFP fusion partners is in the visible identification of 
cells or tissues which express a GFP-MUC fusion polypeptide of the 
invention. Identification may be performed by flow cytometry or fluorescence 

5 microscopy, as are well known in the art. 

The MUC polypeptides and MUC homologs of the invention 
may be in recombinant form of may be chemically synthesized, as is well 
known in the art. Chemical synthesis is preferably suited to production of 
MUC peptides. As used herein, "peptides" have no more than fifty (50) 

10 contiguous amino acids. 

Preferably, MUC polypeptides are in recombinant form. 
In order to produce recombinant MUC polypeptides, isolated 
MUC nucleic acids of the present invention may be ligated into an expression 
vector to form an expression construct capable of directing expression of said 

1 5 MUC nucleic acid in a prokaryotic cell (for example, E. coli) or in a eukaryotic 
cell (for example, yeast cells, fungal cells, insect cells, mammalian cells or 
plant cells). 

Suitably, the expression vector comprises one or more 
regulatory elements which direct expression of the nucleic acid ligated in said 
20 expression construct. Such regulatory sequences include promoters, 
enhancers, splice donor/acceptor sites, polyadenylation sequences, 
translation initiation (Kozak sequences) and translation termination signals. 
Suitable promoters may be constitutive (for example, CMV- or SV40-derived 
promoters) or inducible (for example, Zn responsive metallothionein 
25 promoters) or repressive (tef-repressible promoters). 

Exemplary methods useful for recombinant protein expression 
and purification, including fusion polypeptides, can be found in Chapters 16 
of CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (Eds. Ausubel et al.\ 
John Wiley & Sons Inc., 1997 Edition) and Chapters 5 and 6 of CURRENT 
30 PROTOCOLS IN PROTEIN SCIENCE (Eds. Coligan ef a/.; John Wiley & 
Sons Inc., 1997 Edition) which are herein incorporated by reference. 
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"Analogues" of the MUC polypeptides of the invention 
contemplated herein include, but are not limited to, modification to side 
chains, incorporating of unnatural amino acids and/or their derivatives during 
peptide, polypeptide or protein synthesis and the use of crosslinkers and 
5 other methods which impose conformational constraints on the proteinaceous 
molecule or their analogues. Such chemical analogues may be useful in 
providing stable means for diagnostic purposes or for producing agonists or 
antagonists or for producing stable molecules for use in natural product 
screening. 

10 Examples of side chain modifications contemplated by the 

present invention include modifications of amino groups such as by reductive 
alkylation by reaction with an aldehyde followed by reduction with NaBH 4 ; 
amidination with methylacetimidate; acylation with acetic anhydride; 
carbamoylation of amino groups with cyanate; trinitrobenzylation of amino 

1 5 groups with 2, 4, 6-trinitrobenzene sulphonic acid (TNBS); acylation of amino 
groups with succinic anhydride and tetrahydrophthalic anhydride; and 
pyridoxylation of lysine with pyridoxal-5-phosphate followed by reduction with 
NaBH 4 . 

The guanidine group of arginine residues may be modified by 
20 the formation of heterocyclic condensation products with reagents such as 
2,3-butanedione, phenylglyoxal and glyoxal. 

The carboxyl group may be modified by carbodiimide activation 
via O-acylisourea formation followed by subsequent derivitisation, for 
example, to a corresponding amide. 
25 Sulphydryl groups may be modified by methods such as 

carboxymethylation with iodoacetic acid or iodoacetamide; performic acid 
oxidation to cysteic acid; formation of a mixed disulphides with other thiol 
compounds; reaction with maleimide, maleic anhydride or other substituted 
maleimide; formation of mercurial derivatives using 4-chloromercuribenzoate, 
30 4-chloromercuriphenylsulphonic acid, phenylmercury chloride, 2- 
chloromercuri-4-nitrophenol and other mercurials; carbamoylation with 
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cyanate at alkaline pH. 

Tryptophan residues may be modified by, for example, 
oxidation with N-bromosuccinimide or alkylation of the indole ring with 2- 
hydroxy-5-nitrobenzyl bromide or sulphenyl halides. Tyrosine residues on 
the other hand, may be altered by nitration with tetranitromethane to form a 
3-nitrotyrosine derivative. 

Modification of the imidazole ring of a histidine residue may be 
accomplished by alkylation with iodoacetic acid derivatives or N- 
carbethoxylation with diethylpyrocarbonate. 

Examples of incorporating unnatural amino acids and 
derivatives during peptide synthesis include, but are not limited to, use of 
norleucine, 4-amino butyric acid, 4-amino-3-hydroxy-5-phenylpentanoic acid, 
6-aminohexanoic acid, t-butylglycine, norvaline, phenylglycine, ornithine, 
sarcosine, 4-amino-3-hydroxy-6-methylheptanoic acid, 2-thienyl alanine 
and/or D-isomers of amino acids. A list of unnatural amino acids, 
contemplated herein is shown in Table 1. Crosslinkers can be used, for 
example, to stabilise tertiary conformation, using homo-bifunctional 
crosslinkers such as the bifunctional imido esters having (CH 2 ) n spacer 
groups with n=1 to n=6, glutaraldehyde, N-hydroxysuccinimide esters and 
hetero-bifunctional reagents which usually contain an amino-reactive moiety 
such as N-hydroxysuccinimide and another group specific-reactive moiety 
such as maleimido or dithio moiety (SH) or carbodiimide (COOH). In 
addition, peptides can be conformational ly constrained by, for example, 
incorporation of C a and N a -methylamino acids, introduction of double bonds 
between C a and C„ atoms of amino acids and the formation of cyclic peptides 
or analogues by introducing covalent bonds such as forming an amide bond 
between the N and C termini, between two side chains or between a side 

chain and the N or C terminus. 

The present invention further contemplates chemical analogues 
of the polypeptides of the invention capable of acting as antagonists or 
agonists thereof, or which can act as functional analogues thereof. Chemical 
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analogues may not necessarily be derived from the polypeptides of the 
invention, but may share certain conformational similarities. Alternatively, 
chemical analogues may be specifically designed to mimic certain 
physiochemical properties of MUC poypeptides. Chemical analogues may 
be chemically synthesised or may be detected following, for example, natural 
product screening. Useful sources for screening for natural products include 
coral, reefs, sea beds, river beds, plants, microorganisms and aqua and 

antarctic environments. 

Still another aspect of the present invention is directed to 
antibodies specific for MUC polypeptides and/or homologs thereof. 

In one embodiment, the anti-MUC antibody is M11.9. 

In another embodiment, the anti-MUC antibody is M12.15. 

A detailed method of anti-MUC antibody preparation is provided 

hereinafter. 

In this regard, it will be understood that anti-MUC polypeptide 
antibodies may be produced by immunization with MUC polypeptides or MUC 
peptides. 

In particular, it is also likely that naturally-occurring anti-MUC 
antibodies may well have naturally arisen against MUC polypeptides. 

In light of the foregoing, it will be appreciated that "anti-MUC 
antibody" as used herein is an antibody specific for, or at least binds to, a 
MUC polypeptide, irrespective of how the anti-MUC antibody was produced. 

The anti-MUC antibodies of the present invention may be 
useful as therapeutic or diagnostic agents. 

For example, a MUC polypeptide or homolog can be used to 
screen for naturally occurring anti-MUC antibodies. These may occur, for 
example in some autoimmune diseases. Alternatively, anti-MUC antibodies 
can be used to screen for MUC polypeptides. Techniques for such assays 
are well known in the art and include, for example, sandwich assays and 
ELISA. Knowledge of endogenous MUC polypeptide levels may be important 
for diagnosis of large bowel cancer or a predisposition to large bowel cancers 
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or for monitoring certain therapeutic protocols. This knowledge may also be 
important in other epithelial cancers such as cancer of the breast. 

Anti-MUC antibodies of the present invention may be 
monoclonal or polyclonal. Alternatively, fragments of antibodies may be 
used such as Fab fragments. Furthermore, the present invention extends to 
recombinant and synthetic antibodies and to antibody hybrids. A "synthetic 
antibody" is considered herein to include fragments and hybrids of 
antibodies. The antibodies of this aspect of the present invention are 
particularly useful for immunotherapy and may also be used as a diagnostic 
tool for assessing cancer development or cancer cell apoptosis or monitoring 
the program of a therapeutic regimum. 

For example, anti-MUC antibodies can be used to screen for 
endogenous MUC polypeptides. The latter would be important, for example, 
as a means for screening for levels of the MUC polypeptide in a cell extract 
or other biological fluid or purifying the MUC polypeptide made by 
recombinant means from culture supernatant fluid. Techniques for the 
assays contemplated herein are known in the art and include, for example, 

sandwich assays and ELISA. 

It is within the scope of this invention to include any second 
antibodies (monoclonal, polyclonal or fragments of antibodies or synthetic 
antibodies) directed to the first mentioned antibodies discussed above. Both 
the first and second antibodies may be used in detection assays or a first 
antibody may be used with a commercially available anti-immunoglobulin 
antibody. An antibody as contemplated herein includes any antibody specific 
to any region of the MUC polypeptide. 

Both polyclonal and monoclonal antibodies are obtainable by 
immunization with the enzyme or protein and either type is utilizable for 
immunoassays. The methods of obtaining both types of sera are well known 
in the art. Polyclonal sera are less preferred but are relatively easily 
prepared by injection of a suitable laboratory animal with an effective amount 
of a MUC polypeptide, or antigenic parts thereof, collecting serum from the 
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animal, and isolating specific sera by any of the known immunoadsorbent 
techniques. Although antibodies produced by this method are utilizable in 
virtually any type of immunoassay, they are generally less favoured because 
of the potential heterogeneity of the product. 

The use of monoclonal antibodies in an immunoassay is 
particularly preferred because of the ability to produce them in large 
quantities and the homogeneity of the product. The preparation of hybridoma 
cell lines for monoclonal antibody production derived by fusing an immortal 
cell line and lymphocytes sensitized against the immunogenic preparation 
can be done by techniques which are well known to those who are skilled in 
the art. 

The present invention contemplates a method for detecting a 
MUC polypeptide in a protein extract obtained from a mammal, said method 
including the step of forming a complex between an anti-MUC antibody and 
a MUC polypeptide, and then detecting said complex. 

The presence of a MUC polypeptide may be determined in a 
number of ways such as by Western blotting and ELISA procedures. A wide 
range of immunoassay techniques are available as can be seen by reference 
to U.S. Patent Nos. 4,016,043, 4,424,279 and 4,018,653. These include both 
single-site and two-site or "sandwich" assays of the non-competitive types, 
as well as in the traditional competitive binding assays. These assays also 
include direct binding of a labelled antibody to a target. 

Sandwich assays are among the most useful and commonly 
used assays and are favoured for use in the present invention. A number of 
variations of the sandwich assay technique exist and all are intended to be 
encompassed by the present invention. Briefly, in a typical forward assay, 
an unlabeled antibody is immobilized to a solid substrate and the sample to 
be tested brought into contact with the bound molecule. After a suitable 
period of incubation, for a period of time sufficient to allow formation of an 
antibody-antigen complex, a second antibody specific to the antigen, labelled 
with a reporter molecule capable of producing a detectable signal is then 
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added and incubated, allowing time sufficient for the formation of another 
complex of antibody-antigen-labelled antibody. Any unreacted material is 
washed away and the presence of the antigen is determined by measurement 
of a signal produced by the reporter molecule. The results may either be 
5 qualitative, by simple observation of the visible signal, or may be quantitated 
by comparing with a control sample containing known amounts of hapten. 
Variations on the forward assay include a simultaneous assay, in which both 
sample and labelled antibody are added simultaneously to the bound 
antibody. These techniques are well known to those skilled in the art, 
1 0 including any minor variations as will be readily apparent. In accordance with 
the present invention the protein extract might be a cell extract, tissue biopsy 
or possibly serum, saliva, mucosal secretions, lymph, tissue fluid and 
gastrointestinal fluid. The extract is, therefore, generally a biological sample. 

In the typical forward sandwich assay, a first antibody having 
1 5 specificity for MUC or antigenic parts thereof, is either covalently or passively 
bound to a solid surface. The solid surface is typically glass or a polymer, 
the most commonly used polymers being cellulose, polyacrylamide, nylon, 
polystyrene, polyvinyl chloride or polypropylene. The solid supports may be 
in the form of tubes, beads, discs of microplates, or any other surface 
20 suitable for conducting an immunoassay. The binding processes are well- 
known in the art and generally consist of cross-linking covalently binding or 
physically adsorbing, the polymer-antibody complex is washed in preparation 
for the test sample. An aliquot of the sample to be tested is then added to 
the solid phase complex and incubated for a period of time sufficient (e.g. 2- 
25 40 minutes or overnight if more convenient) and under suitable conditions 
(e.g. from 4°C to 37°C) to allow binding of any subunit present in the 
antibody. Following the incubation period, the solid phase complex is 
washed and dried and incubated with a second antibody which is specific for 
a portion of the antigen (i.e. MUC). The second antibody is linked to a 
30 reporter molecule which is used to indicate the binding of the second 
antibody to MUC. 
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An alternative method involves immobilizing the target 
molecules in the biological sample and then exposing the immobilized target 
to specific antibody which may or may not be labelled with a reporter 
molecule. Depending on the amount of target and the strength of the 
5 reporter molecule signal, a bound target may be detectable by direct labelling 
with the antibody. Alternatively, a second labelled antibody, specific to the 
first antibody is exposed to the target-first antibody complex to form a target- 
first antibody-second antibody tertiary complex. The complex is detected by 
the signal emitted by the reporter molecule. 
-JO By "reporter molecule" as used in the present specification, is 

meant a molecule which, by its chemical nature, provides an analytically 
identifiable signal which allows the detection of antigen-bound antibody. 
Detection may be either qualitative or quantitative. The most commonly used 
reporter molecules in this type of assay are either enzymes, fluorochromes 
15 or radionuclide containing molecules (i.e. radioisotopes) and 
chemiluminescent molecules. 

In the case of an enzyme immunoassay, an enzyme is 
conjugated to the second antibody, such as via glutaraldehyde or periodate 
amongst other means. As will be readily recognized, however, a wide variety 
20 of different conjugation techniques exist, which are readily available to the 
skilled artisan. Commonly used enzymes include horseradish peroxidase, 
glucose oxidase, beta-galactosidase and alkaline phosphatase, amongst 
others. The substrates to be used with the specific enzymes are generally 
chosen for the production, upon hydrolysis by the corresponding enzyme, of 
25 a detectable colour change. It is also possible to employ fluorogenic 
substrates, which yield a fluorescent product rather than the chromogenic 
substrates noted above. In all cases, the enzyme-labelled antibody is added 
to the first antibody-antigen complex, allowed to bind, and then the excess 
reagent is washed away. A solution containing the appropriate substrate is 
30 then added to the complex of antibody-antigen-antibody. The substrate will 
react with the enzyme linked to the second antibody, giving a qualitative 
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visual signal, which may be further quantitated, usually 
spectrophotometrically, to give an indication of the amount of antigen which 
was present in the sample. The term "reporter molecule" also extends to use 
of cell agglutination or inhibition of agglutination such as red blood cells on 
5 latex beads, and the like. 

Also, fluorescent compounds, such as fluorescein and 
rhodamine, may be chemically coupled to antibodies without altering their 
binding capacity. When activated by illumination with light of a particular 
wavelength, the fluorochrome-labelled antibody adsorbs the light energy, 
10 inducing a state of excitability in the molecule, followed by emission of the 
light at a characteristic colour visually detectable with a light microscope. As 
in the EIA, the fluorescent labelled antibody is allowed to bind to the first 
antibody-hapten complex. After washing off the unbound reagent, the 
remaining tertiary complex is then exposed to light of the appropriate 
15 wavelength and the fluorescence observed indicates the presence of the 
antigen of interest. Immunofluorescene and EIA techniques are both very 
well established in the art. Other reporter molecules, such as radioisotope, 
chemiluminescent or bioluminescent molecules, may also be employed. 

The MUC genes of the present invention are likely to function 
20 in cell adhesion, signal transduction, growth regulation, epithelial cell 
protection and/or immunological reactions. The classical gel-forming mucins 
function in protecting and lubricating epithelial tissues (particularly those of 
the respiratory and gastrointestinal tracts) by forming a layer of viscoelastic 
gel. These new mucins, MUCH and MUC12, show structural similarity to 
25 MUC1. MUC1 can be secreted, but unlike the classical mucins, it is primarily 
a type I transmembrane protein that interacts and complexes with other 
adhesion molecules, and is involved in signal transduction. MUC12 has an 
EGF growth factor-like domain, is likely to be a transmembrane protein and 
has a putative tyrosine phosphorylation site that may participate in 
30 intracellular signalling. It is hypothesised that loss of MUC12 may be 
associated with poor prognosis in CRC. 
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The isolated MUC nucleic acids of the present invention are, 
therefore, considered in one embodiment, to correspond to cancer 
suppressor genes. Suppression may mean total inhibition of any 
development of large bowel cancer or a limitation of the severity of or an 
5 amelioration of the condition resulting from a large bowel cancer. The MUC 
nucleic acids of the present invention are also considered in another 
embodiment to be capable of modulating disease conditions such as CRC, 
BC, IBD, CF, asthma, chronic bronchitis, ulcerative colitis and/or Crohn's 
disease 

10 Cystic fibrosis (CF) is an inherited disease of epithelial cell 

chloride ion transport that affects multiple organ systems. It is the most 
common cause of severe, progressive lung disease and exocrine pancreatic 
insufficiency in childhood. The cystic fibrosis transmembrane conductance 
regulator (CFTR) gene located on chromosome 7q22 encodes a large single 
15 chain protein that forms a chloride channel. Virtually all of the morbidity and 
mortality associated with mutations in the CFTR gene causing cystic fibrosis 
arise from respiratory disease due to chronic infection and mucus 
obstruction. The precise mechanism of mucus accumulation in cystic fibrosis 
is controversial. Data suggest that CFTR malfunction may trigger mucin 
20 secretion and alter mucus properties, and/or bacterial infection triggers the 
hypersecretion of mucin in CF patients. The gene of the present invention 
is expressed in the colon, pancreas, small intestine, and lung, all tissues 
where mucus obstruction occurs. Accordingly, aberrant expression of the 
genes may contribute to cystic fibrosis. 
25 Aberrant mucin expression is also a recognised component of 

IBD. Inflammatory bowel disease is characterised by considerable alterations 
in glycosylation, sialyation and sulphation of glycoproteins. It is unclear 
whether the changes in mucus production are a cause or response to the 
disease. Susceptibility genes for inflammatory bowel disease have been 
30 localised to chromosomes 3, 12 and 7q22. Accordingly, the MUC genes of 
the present invention are considered candidates for susceptibility genes for 
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IBD. Up or down regulation, or altered secretion of one of these mucins may 
influence the quality of colonic mucus and therefore the pathology of these 
diseases. Certain inherited forms of these genes may indicate a 
predisposition to IBD. 
5 The identification of MUC genes and isolated MUC nucleic 

acids permits the generation of a range of therapeutic methods and 
compositions. Such therapeutics may modulate MUC gene expression and 
the activity of MUC polypeptides. Modulators contemplated by the present 
invention includes agonists and antagonists of MUC gene expression. 
10 Antagonists of MUC gene expression include antisense molecules, 
ribozymes and co-suppression molecules. Agonists include molecules which 
increase promoter activity or interfere with negative mechanisms. Agonists 
of MUC include molecules which overcome any negative regulatory 
mechanism. Antagonists of MUC polypeptides include antibodies and 
1 5 inhibitor peptide fragments. Another class of therapeutics may be designed 
to mimic or block intracellular signal transduction by MUC polypeptides. 

In accordance with the present invention, it is proposed that 
MUC functions as a suppressor of cancer development in the large bowel. 
Hereditary cancers arise with loss of the wild-type gene. In addition, 
20 germline mutations underlying large bowel cancer are inactivated for the 
MUC genes and, therefore, hereditary cancers have no functional copy of the 
gene. Furthermore, sporadic large bowel cancers arise with somatic loss of 
both copies of the gene. The present invention extends to the use of 
modulating levels of expression of MUC genes or their translation products 
25 in the context of cancers related thereto. 

Thus, the present invention contemplates a method of gene 
therapy of a mammal. Such a method utilizes a gene therapy construct which 
includes an isolated MUC nucleic acid ligated into a gene therapy vector 
which provides one or more regulatory sequences that direct expression of 
30 said nucleic acid in said mammal. 

Such regulatory sequences may include a promoter, an 
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enhancer, a polyadenylation sequence, splice donor/acceptor sequences 
and translation termination and intiation sequences. 

Typically, gene therapy vectors are derived from viral DNA 
sequences such as adenovirus, adeno-associated viruses, herpes-simplex 
5 viruses and retroviruses. Suitable gene therapy vectors currently available 
to the skilled person may be found in Robbins et a/., 1998, Trends 
Biotechnol. 16 35, for example, which is herein incorporated by reference. 

If "anti-sense" therapy is contemplated, then one or more 
selected portions of a MUC nucleic acid may be oriented 3'-45' in the gene 

1 0 therapy vector. 

Administration of the gene therapy construct to said mammal, 
preferably a human, may include delivery via direct oral intake, systemic 
injection, or delivery to selected tissue(s) or cells, or indirectly via delivery to 
cells isolated from the mammal or a compatible donor. An example of the 
15 latter approach would be stem-cell therapy, wherein isolated stem cells 
having potential for growth and differentiation are transfected with the vector 
comprising the MUC nucleic acid. The stem-cells are cultured for a period 
and then transferred to the mammal being treated. 

Delivery of said gene therapy construct to cells or tissues of 
20 said mammal or said compatible donor may be facilitated by microprojectile 
bombardment, liposome mediated transfection (e.g. lipofectin or 
lipofectamine), electroporation, calcium phosphate or DEAE-dextran- 
mediated transfection, for example. A discussion of suitable delivery methods 
may be found in Chapter 9 of CURRENT PROTOCOLS IN MOLECULAR 
25 BIOLOGY (Eds. Ausubel et a/.; John Wiley & Sons Inc., 1997 Edition), for 
example, which is herein incorporated by reference. 

For example, a MUC nucleic acid may be introduced into a cell 
to enhance the ability of that cell to survive, conversely, MUC antisense 
sequences such as 3'-» 5' oligonucleotides may be introduced to decrease 
30 the survival capacity of any cell expressing an endogenous MUC gene. 

In this regard, increased MUC expression or activity is 
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important in conditions of repressing cancer growth and/or development. 
Decreased MUC expression or activity may be important, for example, in the 
treatment of cystic fibrosis or the treatment of inflammatory bowel disease. 

Accordingly, the present invention contemplates a 
pharmaceutical composition comprising a MUC polypeptide or a derivative 
thereof or a modulator of MUC gene expression or activity, inclusive of anti- 
MUC antibodies. These components are referred to herein as the "active 
ingredients", and are suitably provided in combination with one or more 
pharmaceutically-acceptable carriers and/or diluents. 

The pharmaceutical forms suitable for injectable use include 
sterile aqueous solutions (where water soluble) and sterile powders for the 
extemporaneous preparation of sterile injectable solutions. It must be stable 
under the conditions of manufacture and storage and must be preserved 
against the contaminating action of microorganisms such as bacteria and 
fungi. The carrier can be a solvent or, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol and liquid polyethylene glycol, and the 
like) or suitable mixtures thereof as well as vegetable oils. The preventions 
of the action of microorganisms can be brought about by various antibacterial 
and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic 
acid, thirmersal and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged 
absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum 

monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the 
active compounds in the required amount in the appropriate solvent with 
various of the other ingredients enumerated above, as required, followed by 
filtered sterilization. In the case of sterile powders for the preparation of 
sterile injectable solutions, the preferred methods of preparation are vacuum 
drying and the freeze-drying technique which yield a powder of the active 
ingredient plus any additional desired ingredient from previously 
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sterile-filtered solution thereof. 

When the active ingredients are suitably protected they may be 
orally administered, for example, with an inert diluent or with an assimilable 
edible carrier, or it may be enclosed in hard or soft shell gelatin capsule, or 
5 it may be compressed into tablets, or it may be incorporated directly with the 
food of the diet. For oral therapeutic administration, the active compound 
may be incorporated with excipients and used in the form of ingestible 
tablets, buccal tablets, troches, capsules, elixirs, suspensions, syrups, 
wafers, and the like. Such compositions and preparations should contain at 
1 0 least 1 % by weight of active compound. The percentage of the compositions 
and preparations may, of course, be varied and may conveniently be 
between about 5 to about 80% of the weight of the unit. The amount of 
active compound in such therapeutically useful compositions in such that a 
suitable dosage will be obtained. Preferred compositions or preparations 
1 5 according to the present invention are prepared so that an oral dosage unit 
form contains between about 0.1 ug and 2000 mg of active ingredient. 

The tablets, troches, pills, capsules and the like may also 
contain the components as listed hereafter: a binder such as gum, acacia, 
corn starch or gelatin; excipients such as dicalcium phosphate; a 
20 disintegrating agent such as corn starch, potato starch, alginic acid and the 
like; a lubricant such as magnesium stearate; and a sweetening agent such 
a sucrose, lactose or saccharin may be added or a flavouring agent such as 
peppermint, oil of wintergreen, or cherry flavouring. When the dosage unit 
form is a capsule, it may contain, in addition to materials of the above type, 
25 a liquid carrier. Various other materials may be present as coatings or to 
otherwise modify the physical form of the dosage unit. For instance, tablets, 
pills, or capsules may be coated with shellac, sugar or both. A syrup or elixir 
may contain the active compound, sucrose as a sweetening agent, methyl 
and propylparabens as preservatives, a dye and flavouring such as cherry 
30 or orange flavour. Of course, any material used in preparing any dosage unit 
form should be pharmaceutical^ pure and substantially non-toxic in the 
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amounts employed. In addition, the active compound(s) may be incorporated 
into sustained-release preparations and formulations. 

Pharmaceutically acceptable carriers and/or diluents include 
any and all solvents, dispersion media, coatings, antibacterial and antifungal 
5 agents, isotonic and absorption delaying agents and the like. The use of 
such media and agents for pharmaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible 
with the active ingredient, use thereof in the therapeutic compositions is 
contemplated. Supplementary active ingredients can also be incorporated 
10 into the compositions. 

It is especially advantageous to formulate parenteral 
compositions in dosage unit form for ease of administration and uniformity of 
dosage. Dosage unit form as used herein refers to physically discrete units 
suited as unitary dosages for the mammalian subjects to be treated; each 
15 unit containing a predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the novel dosage unit forms of 
the invention are dictated by and directly dependent on (a) the unique 
characteristics of the active material and the particular therapeutic effect to 
20 be achieved, and (b) the limitations inherent in the art of compounding such 
an active material for the treatment of disease in living subjects having a 
diseased condition in which bodily health is impaired. 

The principal active ingredient is compounded for convenient 
and effective administration in effective amounts with a suitable 
25 pharmaceutically acceptable carrier in dosage unit form as hereinbefore 
disclosed. A unit dosage form can, for example, contain the principal active 
compound in amounts ranging from 0.5 ug to about 2000 mg. Expressed in 
proportions, the active compound is generally present in from about 0.5 ug 
to about 2000 mg/mL of carrier. In the case of compositions containing 
30 supplementary active ingredients, the dosages are determined by reference 
to the usual dose and manner of administration of the said ingredients. It is 
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also convenient to represent the effective amounts of active ingredients as 
an amount per kg body weight. For example, the present invention 
encompasses effective amounts for 0.005 ug/kg body weight at 2000 mg/kg 
body weight. 

5 The pharmaceutical composition may also comprise genetic 

molecules such as a vector capable of transfecting target cells where the 
vector carries a nucleic acid molecule capable of modulating MUC gene 
expression or MUC polypeptide activity. The vector may, for example, be a 
viral vector. 

1 o From the foregoing, it is apparant that therapeutic methods and 

compositions of the invention are useful in the treatment of disease 
conditions associated with aberrant Mucin expression, altered properties of 
mucus or epithelial inflammatory processes involving Mucins. 

Preferably, the disease condition is selected from the group 

15 consisting of colorectal cancer (CRC), cystic fibrosis (CF), inflammatory 
bowel disease (IBD), breast cancer (BC), Crohn's disease, ulcerative colitis, 
asthma and chronic bronchitis. 

More preferably, the disease condition is selected from the 
group consisting of colorectal cancer (CRC), cystic fibrosis (CF), 

20 inflammatory bowel disease (IBD) and breast cancer (BC). although not 
limited thereto. The therapeutic methods of the invention may therefore be 
used to alleviate one or more symptoms of diseases or be used as 
prophylactic treatments to prevent, or reduce the likelihood of, said symptoms 
from occurring. 

25 The present invention is further described by the following non- 

limiting Examples. 

F X A M P L E S 
EXAMPLE 1: Tissue Specimens 

Tissue specimens were collected from patients undergoing 
30 surgery (Dukes' A n=5; Dukes' B n=5, Dukes C n=5, Dukes' D n=5). Colonic 
specimens were obtained from patients undergoing either colectomy or 
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partial hepatectomy for colorectal carcinoma. Samples of normal colonic 
mucosa, primary colon cancer, liver metastases (if present) and adjacent 
normal liver were rapidly excised from operative specimens, snap-frozen in 
liquid nitrogen and stored at -70°C until use. Care was taken to exclude 
5 normal mucosal tissue from tumour samples. Junctional tissue specimens 
from four tumours of each Dukes' stage were randomly selected for in situ 
hybridisation. Tissues were fixed for 24-48 hours in 10% v/v buffered 
formalin, dehydrated in ethanol, cleaned in chloroform and embedded in 
parraffin wax. Biopsy specimens of normal colonic epithelium from four 
10 distinct regions of the colon were collected via colonoscopy from each of 
three healthy individuals undergoing routine colonoscopic screening. 
Similarly, intestinal biopsies were obtained via colonoscopy from ten patients 
with inflammatory bowel disease. Specimens were snap-frozen and stored 
at -70° C until RNA was extracted as per Example 3 below. 
15 EXAMPLE 2 : Cell Lines and Culture 

Seven human colonic tumour lines were obtained: LIM1215, 
LIM2405, LIM1863, LIM1899 (Ludwig Institute, Melbourne, Australia), HT29 
(ATCC HTB38), SW480 (ATCC CCL 228) and SW620 (ATCC CCL 227). 
LIM1215 and SW620 are each derived from CRC metastases. Cell lines were 
20 maintained in RPM1 1640 with 10% v/v fetal calf serum, 2 mM glutamate, 25 
mM HEPES, 60 mg/ml penicillin G and 100 mg/ml streptomycin sulfate and 
incubated in 5% v/v C0 2 and 95% v/v air at 37°C. Cultures were passaged 
twice weekly using standard techniques. The following breast carcinoma 
lines were included in this study: KPL-1 (a gift of Dr Junichi Kurebayashi, 
25 Suzuki, Japan), MA11 (a gift of Dr Philip Rye, Oslo, Norway), BT 20, 
DU4475, MCF-7, MDA-MB-453, SK-Br-3, T47D, UACC-893, ZR-75-1 and 
ZR-75-30 (ATCC, Rockville, MD), and MDA-MB-435 and MDA-MB-468 (a gift 
of Dr. Janet Price, MD Anderson Cancer Center, Houston, TX). All breast 
cancer cell cultures were maintained in RPMI-1640 medium supplemented 
30 with 1 0% fetal calf serum and 0.006% penicillin and 0.01 % streptomycin with 
the following exceptions: DU-4475 in RPMI-1640 with 20% FCS, KPL-1 was 
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maintained in DMEM with 5% FCS, MA11 in 1:1 Ham's F12:RPMI-1640 with 
10% FCS, SK-Br-3 in McCoy's medium with 15% FCS, and UACC-893 in 
RPMI-1640 with 15% FCS. 
EXAMPLE 3 : RNA Extraction 

Total RNA was isolated by the method of Chomczynski and 
Sacchi (Chomczynski et a/., 1987, Anal. Biochem. 162 156). Cells were 
resuspended in RNA extraction buffer (4 M guanidinium isothiocyanate 
containing 25 mM sodium citrate, pH7.0, 0.5 % w/v sodium lauroyl sarcosine 
(SLS) and 0.1 M 2-mercaptoethanol). Tissue samples were homogenised in 
RNA extraction buffer. Extracted RNA was dissolved in RNase-free water and 
the concentration and purity determined by spectrophotometry at 260 and 
280 nm (Sambrook et a/., Molecular Cloning, A Laboratory Manual. 2nd Ed. 
Cold Spring Harbour Laboratory Press. Cold Spring Harbour, NY, 1989). The 
integrity of the RNA was assessed by denaturing agarose gel electrophoresis 
and samples transferred to HYBOND N (Amersham, Bucks, England) 
membrane by capillary blotting. 
EXAMPLE 4 : DA/A Sequencing 

Approximately 500 ng of DNA were employed in a cycle 
sequencing reaction with 2.5 pmol of primer and 4 ul of Dye terminator or 
dRhodamine reaction mix (DNA Cycle Sequencing Kits, Perkin Elmer, 
Norwalk, CT,) in a total volume of 10 pi. Reaction mixes contained Amplitaq 
DNA polymerase, dNTPs and fluorescently labelled dideoxynucleotides (dye 
terminators). Cycling reactions were as follows: 25 cycles of denaturation at 
96°C (30 s), primer annealing at 50°C (15 s) and extension at 60°C (4 min). 
Unincorporated nucleotides were removed by ethanol precipitation. The 
reactions were analysed on a Model 373A automated DNA sequencer 
(Applied Biosystems) run by technical staff in the core sequencing facility of 
the Queensland Institute of Medical Research. 

EXAMPLE 5 : Identification bv Differential Display of Two cDNAs 

Encoding Mucins Downrepulated in Colorectal 



cancer 
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The differential display method was devised from the original 
technique described by Liang & Pardee, 1992, Science 257 967. Total RNA 
was isolated by the method as described previously. Reverse transcription 
was carried out using one of four anchored primers, T 12 MG, T 12 MC, T 12 MA 
and T 12 MT (Operon Technologies Inc., Alameda, CA) and Superscript RNAse 
H- reverse transcriptase (Gibco BRL, Gaithersburg, MD). One arbitrary 
10mer primer (Operon Technologies Inc.) was selected at random to be 
employed in a PCR with the appropriate anchored primer. Two patients, 101 
and 112, were analyzed simultaneously and duplicates of two separate 
reverse transcription reactions electrophoresed on each gel. Gels were put 
down wet and autoradiographed for 1-3 days. DNA was removed from gel 
slices by boiling and reamplified by PCR. Bands were then cloned into 
pGEM-T (Promega Corporation, Madison, Wl) and sequenced. Sequences 
were analysed by multiple sequence similarity searches using BLAST 
algorithms (Altshcul ef a/., 1990, supra) accessed through the National 
Centre of Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov). 

Differential display was performed on RNA from paired normal 
colonic mucosa and primary colorectal cancers. Using a PCR primer 
combination of T 12 MG and 10mer S'-ACTTCGCCAC-S' (SEQ ID NO:7), 
bands dd29 (MUC12) and dd34 (MUC11) were both amplified from normal 
colonic mucosal RNA of two patients and were consistently downregulated 
in the tumors from these patients in multiple PCR reactions (FIG. 1A). 
Following reamplification PCR, discrete bands of approximately 720 bp for 
dd29 and 530 bp for dd34 were isolated and cloned into pGEM-T. Sequence 
analysis showed that both cDNAs were novel, with no match in any database 
accessed through the NCBI. Repetitive segments typical of mucin tandem 
repeats were observed in dd34. 
EXAMPLE 6 : Northern Blot Analysis 

Northern blot analysis was performed on paired normal and 
tumor total RNA extracted from the same patients employed in the differential 
display experiment. dd29 (MUC12) and dd34 (MUC11) were random primer- 
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labeled using a Megaprime DNA labeling system (Amersham, Aylesbury, UK) 
and hybridization performed at 65°C in buffer containing 7% SDS, 0.26 M 
Na 2 HP0 4 , 1 mM EDTA, 1% BSA. 

Northern blot analyses of dd29 (FIG. 1 B) and dd34 (FIG. 1C) 

5 with colonic total RNA used for the differential display reactions revealed a 
polydisperse signal beginning near the top of the gel for RNA isolated from 
normal colonic mucosa and no signal in tumor-derived RNA. Probe dd29 
showed some cross-hybridization to ribosomal RNA. Polydispersity of signal 
is a hallmark of mucin RNA blots due to shearing of very high molecular 

10 weight transcripts. 

EXAMPLE 7 : Multiplex Semi-auantitative RT-PCR 

Multiplex semi-quantitative RT-PCR was performed on total 
RNA isolated from six colorectal cancer cell lines and from paired normal 
colonic mucosa and tumor colorectal cancer tissues from 20 patients, five of 

15 each Dukes' stage. Informed consent was obtained from each subject after 
approval by the appropriate hospital Ethics Committee. PCR products were 
quantitated relative to a [^-microglobulin cDNA amplification control using 
densitometry. First strand cDNA synthesis was accomplished using 1 ug of 
total RNA. PCR amplification of cDNA was performed in a total volume of 25 

20 pi containing 1 ul of the first strand cDNA synthesis reaction products, 2.5 ul 
10x Tag polymerase buffer (25 mM TAPS (tris-[hydroxymethyl]-methyl-amino- 
propane-sulfonic acid, sodium salt) pH 9.3, 50 mM KCI), 2 mM dNTPs, 25 mM 
MgCI 2 , 20 pmol each of the forward and reverse primers, and 2.5 U Taq 
polymerase. Gene-specific forward and reverse primers for MUC12 and 

25 MUCH were designed to produce PCR products of 510 bp and 169 bp 
respectively. Primers for p 2 -microglobulin generated a PCR product of 247 
bp (Gussowef a/., 1987, J. Immunol. 139 3132). Primers were: 
MUC12F1; 5'-TGAAGGGCGACAATCTTCCTC-3' (SEQ ID NO:8); 
MUC12R1; 5'-TACACGAGGCTCTTGGCGATGTTG-3' (SEQ ID NO:9); 

30 MUC1 1F1; 5'-CAGGCGTCAGTCAGGAATCTACAG-3' (SEQ ID NO:10); 

MUC11R1; 5'-GAGGCTGTGGTGTTGTCAGGTAAG-3' (SEQ ID NO: 11); 
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P-21F; 5'-TGAATTGCTATGTGTCTGGGT-3' (SEQ ID NO:12); 
(3-21 R; 5'-C CTC C ATG ATG CTG CTTAC AT-3' (SEQ ID NO: 13); 
MUC12TOTF1;5'-AGCCAACCAGGCTCAGCTCT-3' (SEQ ID NO:14); and 
MUC12TOTR1;5'-GCTCACACAGTGGATGCTACC-3' (SEQ ID NO: 15). 

After an initial denaturation step of 94°C for 5 minutes, the 
amplification conditions were: 21 cycles of denaturation at 94°C (30 s) for 
MUC12, (24 cycles of denaturation at 94°C (30 s) for MUC1 1), annealing at 
60° C (30 s) and extension at 72°C (30 s). PCR products were 
electrophoresed on 1.2% 1x TBE gels and photographed. 

Due to the polydisperse signals obtained by Northern analysis, 
expression of MUC11 and MUC12 was examined in a range of colorectal 
cancer cell lines and tissue mRNAs by multiplex semi-quantitative RT-PCR. 
dd29 was not expressed in any of six colorectal cancer cell lines examined 
(FIG. 1D). In contrast, dd34 showed a different pattern of expression, with 
HT29, LIM1215, LIM1899, LIM1863 lines revealing very faint PCR products, 
and SW620 and SW480 lines showing relatively high levels of expression 
(FIG.1E). For tumor tissue-derived RNA, downregulation was defined as 
amplified band intensity less than 30% of that observed from paired normal 
colon tissue. dd29 was found to be downregulated or absent in 6/15 (40%) 
tumors with paired normal samples, and at low levels in 3/5 (60%) Dukes' 
stage D samples (where normal colon was not available for comparison) 
(FIG.1F). dd34 was downregulated in the tumors of 12/15 (80%) paired 
samples and expressed at low levels in 4/5 (80%) Dukes' stage D samples. 
One of five Dukes' stage D samples showed relatively high levels of 
expression of dd34 (FIG. 1G). Significantly, 13/15(87%) colorectal cancers 
showed downregulation of at least one of these mucin genes, with 5/15 
(33%) showing downregulation of both genes. 

EXAMPLE 8 : Differential Tissue Dist ribution of MUC11. MUC12 

and MUC3 mRNAs 

A human RNA "master blot" (Clontech, Palo Alto, CA, catalogue 
number 7770-1 ) with RNA from 50 different tissues and controls was used to 
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examine mucin gene expression. DNA fragments encoding dd29, dd34 and 
MUC3 (Genbank Accession No. M55405, a gift from Dr. Sandra Gendler, 
Mayo Clinic, Scotsdale, Arizona) were excised from vector and radiolabeled 
as described above. Hybridization was performed as per the manufacturer's 

5 instructions. The master blot was reprobed with a radiolabeled (3-actin cDNA 
as a loading control. 

Analysis of the tissue distribution of MUC11, MUC12 and MUC3 
transcripts in RNA isolated from 50 different normal tissues showed a distinct 
pattern of expression for each gene (FIG. 5). MUC12 and MUC11 showed 

1 0 highest expression in colon but had different patterns in other organs, mainly 
restricted to those of epithelial type. MUC11 had a wider epithelial 
distribution than MUC12 which was restricted to expression in the colon, and 
weakly in the pancreas, prostate and uterus. Consistent with published 
findings (Van Klinken ef a/., 1997, Biochem. Biophys. Res. Comm. 238 143), 

15 MUC3 was found to be predominantly expressed in the small intestine and 
at much lower levels in the colon. Interestingly, it was also present in the 
thymus. 

EXAMPLE 9 : Extending the Sequenc es ofdd29 and dd34 

The strategy employed in the cloning oi.MUCH and MUC12 is 
20 shown in FIGS. 8A and 8B respectively. 
9.1 l ihrarv Screening . 

A Xgt11 human fetal brain 5'-STRETCH PLUS cDNA library 
(Clontech, Palo Alto, CA) was screened using radiolabeled dd29 and dd34. 
X DNA was extracted and inserts were excised, cloned into pBSK- and 

25 sequenced. 

9 2 PGR to extend the sequence ofdd34 b v linking clones 2 and H5 

Screening of the fetal brain library with clone dd34, yielded two 

new cDNA clones: clone 2 (1043 bp) and clone Ii5 (1045 bp). Clone dd34 

was a perfect match to the middle of the larger clone 2. cDNA from clone Ii5, 
30 however, was highly homologous but not identical to the cDNA from clone 

dd34. To ascertain whether these partial cDNAs arose from a single mRNA 
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transcript, RT-PCR was carried out using combinations of forward and 
reverse primers specific for each cDNA in an attempt to link them. RT-PCR 
was performed on total RNA extracted from normal colon in a stringent 
touchdown PCR using high fidelity DyNAzyme DNA polymerase (Finnzymes, 
Espoo, Finland). Primer combination MUC11F1 and li5R (5'- 
G G GAACACTGTGGTTTC AGTTG AG-3' ; SEQ ID NO:16) yielded a PCR 
product of 2 kb demonstrating that these two cDNAs were derived from a 
single transcript. This product was cloned into pGEM-T and sequenced. 
9.3 PCR library screening - to extend sequence ofdd29 

Forward and reverse primers for dd29 (dd29F1 and dd29R1) 
were used in combination with a T7 vector-derived primer in a stringent 
touchdown PCR to screen an ulcerative colitis (UC) plasmid library (a gift 
from Dr. Jonathon Fawcett, Queensland Institute of Medical Research, 
Brisbane, Australia). Amplified products were purified, cloned into pGEM-T 
and sequenced. 

EXAMPLE 10 : Sequence Analysis ofdd29 (MUC12) 

The sequence of dd29 revealed that it was amplified as a result 
of priming of random 10mer at both ends of the PCR product and that it did 
not contain a 3' untranslated region (3'-UTR) or poly A tail. Screening of an 
UC cDNA library with dd29-specific primers extended the sequence 840 bp 
in the 5' direction and 800 bp in the 3' direction to the poly A tail (Genbank 
Accession Number AF1 47790). To confirm contiguous cDNA sequence, 
primers MUC12TOTF1 and MUC12TOTR1 were designed to produce an 
expected PCR product of 1532 bp; primers corresponded to bases 230-250 
and 1742-1762, respectively, in SEQ ID NO:6. In a stringent touchdown PCR 
amplification procedure an intense discrete product of the expected size was 
identified from normal colonic cDNA and cDNA from the Caco-2 colonic 
cancer cell line. This reaction confirmed the reported MUC12 cDNA 
sequence. 

Conceptual translation of the composite MUC12 cDNA reveals 
the presence of serine/threonine and proline-rich degenerate tandem repeats 
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(FIG. 2) consistent with this protein being a member of the epithelial mucin 
family. The deduced 28 amino acid tandem repeat structure is shown in FIG. 
2. Following the mucin-repeat domain, MUC12 contains two cysteine-rich 
EGF-like domains separated by a 150 amino acid non-mucin-like sequence 
5 (amino acids 261-410) containing five N-glycosylation sites and a potential 
coiled-coil domain. The second cysteine-rich EGF-like domain is immediately 
followed by a putative transmembrane domain containing 26 hydrophobic or 
uncharged amino acids, and a cytoplasmic tail of 75 amino acids at the 
carboxyl terminus. 

10 Sequence alignment of MUC12, human MUC3 (hMUC3), rat 

Muc3 (rMuc3), mouse Muc3 (mMuc3), human MUC4 (hMUC4) and rMuc4 is 
shown in FIG. 3. When aligned by the transmembrane amino acid 
sequences, MUC12 was found to have areas of significant homology to 
rMuc3, mMuc3 and hMUC3, including perfect conservation of eight cysteine 

1 5 residues in the second EGF-like domain. With inclusion of three small gaps, 
each of these cysteines also align with those in rat and human MUC4. 
Interestingly, all six mucins contain a conserved EGF-like sequence of 
Cx(5)GPxCxCx(9)GExC. Furthermore, there is some (4 out of 8) conservation 
of the cysteine residues between MUC12 and the human and rodent MUC3 

20 and MUC4 mucins in the first EGF-like domain. 

EXAMPLE 11 : Sequence Analysis o fdd34 (MUC11) 

Clone dd34 (544 bp) was also obtained as a result of priming 
of random 10mers at both ends of the PCR product. Screening of a Xgt11 
human fetal brain library yielded two positive plaques which hybridized to 

25 dd34, clone Ii5 (1045 bp) and clone 2 (1043 bp). These two clones 
represented opposite ends of a 2.8 kb partial MUC11 cDNA sequence 
(Genbank Accession Number AF1 47791), the linking of which was 
established by PCR (see Methods). Conceptual translation of the MUC11 
composite is shown in FIG. 4. The entire 957 amino acid sequence consisted 

30 of serine, threonine and proline-rich tandem repeats of 28 amino acids in 
length, consistent with it being derived from a large epithelial mucin. The 
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deduced tandem repeat structure and consensus repeat sequence for 
MUC11 is shown in FIG. 4. 

EXAMPLE 12 : Chromosomal Localization ofMUC11 and MUC12 

DNA fragments excised from dd29 (720 bp) and dd34 (530 bp) 
5 were nick translated with biotin-14-dATP and hybridized in situ at a final 
concentration of 10 ng/ul to metaphases from two normal males. The 
fluorescence in situ hybridization (FISH) method was modified from that 
previously described (Callen et ai, 1990, Ann. Genet. 33 219) in that 
chromosomes were stained before analysis with both propidium iodide as 
1 0 counterstain and DAPI for chromosome identification. Images of metaphase 
preparations were captured by a cooled CCD camera using the CyroVision 
Ultra image collection and enhancement system (Applied Imaging Int Ltd, 

Sunderland, U.K.). 

Twenty metaphases from a normal male were examined for 
15 hybridization to dd29 and dd34 probes. For both genes, all of the 
metaphases showed strong signal on one or both chromatids of chromosome 
7, at band 7q22 (data not shown). A similar result was obtained using 
metaphases from a second normal male. 

EXAMPLE 13. Production of monoclonal anti bodies reactive with 
20 MUC11 and 12 

The following peptides were conjugated to keyhole limpet 
haemocyanin (KLH) with the heterobifunctional cross-linking agent m- 
maleimidobenzoyl-N-hydroxysuccinimide ester using standard techniques 
(Harlow, E. & Lane, D. Antibodies: A Laboratory Manual, Cold Spring 
25 Harbor, Cold Spring Harbor Laboratory, 1 988, which is herein incorporated 
by reference): - 

MUC1 1 : CFHSRPASTHTTLFTED (SEQ ID NO: 1 7); corresponding to 
part of the degenerate tandem repeat region, specifically amino 
acid residues 690-705 deduced from the partial cDNA MUC1 1 
30 clone, with an N-terminal cysteine residue added for 

conjugation); 
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MUC12: TYRNFTEKMNDASSQEC (SEQ ID NO: 18); corresponding to 
part of the N- glycosylated region, specifically amino acid 
residues 286-302 deduced from the partial cDNA MUC12 
clone, with a C-terminal cysteine residue added for 
conjugation). 

One Balb/c mouse was immunised with each KLH-conjugated 
peptide as per the following protocol:- 

DayO. KLH-conjugated peptide was diluted to 100 ug/mL in 
phosphate buffered saline (PBS) and mixed with an equal 
volume of complete Freund's adjuvant (CFA). Each mouse 
was injected intra-peritonealy with 0.5 ml_ of this mixture. 

Day 14: Each mouse was immunised as above but peptide was mixed 
with incomplete Freund's adjuvant (IFA). 

Day 33: Each mouse was immunised as on day 14. 

Day 43: Each mouse was bled from the tail to assess antibody 
production by ELISA (see below). 

Day 53: Each mouse was injected intra-venously with 1 00 uL of peptide 
at 100 ug/mL in PBS without adjuvant, and with 100 uL mixed 
with IFA intra-peritonealy. 

Day 56: Mice were euthanased, and the spleen removed for fusion with 
myeloma cells. 

Splenocytes were fused to Ag8 mouse myeloma cells at a ratio 
of 5:1 with polyethylene glycol using established methods (Harlow & Lane, 
supra). 

Specific antibody producing clones were screened by a solid 
phase antigen antibody capture ELISA with the immunizing peptides bound 
to polystyrene assay plates using established methods (Harlow & Lane, 
supra). Positive clones were expanded, retested for specific antibody 
production and recloned by limiting dilution. Clones were further tested for 
reactivity with paraffin embedded normal colonic mucosa. 
13,1 MUC1 1 and MUC12 reactive hvbridomas 
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Two hybridomas, one reacting with each of MUC1 1 and MUC12 
peptides and with paraffin embedded colonic sections are described in Table 
2. 

13.2 Immunohistochemical detection of MUC11 and MUC12 in 

normal colonic epithelium using antibodies M1 1.9 and M12. 15 
Paraffin sections (4 urn) of normal colonic epithelium were 
dewaxed with xylene, hydrated in a graded series of ethanol to water and 
treated with 0.1 U/mL neuraminidase (Boehringer, Germany) in 50 mM Na 
acetate, 150 mM NaCI, 100 mM CaCI 2 buffer, pH 5.5 for 1 hr at room 
termperature to remove sialic acid groups. Sections were then treated with 
1% H 2 0 2 , 0.1% NaN 3 in Tris buffered saline (TBS) for 10 min to quench 
endogenous peroxidase activity, and non-specific protein binding blocked 
with 4% skim milk in TBS for 15 min. Monoclonal antibodies M11.9 and 
M12.15 were semi-purified by PEG precipitation and diluted to 5-50 ug/mL 
in TBS/50% non-immune goat serum and incubated for 2 hours overnight at 
room temperature. Sections were washed once with 1 % TX-1 00 in TBS for 
5 min and then twice in TBS for 5 min. Sections were incubated for 30 min 
at room temperature with pre-diluted biotinylated goat anti-mouse 
immunoglobulins (Zymed, USA) and then washed as above. Sections were 
then incubated for 15 min at room temperature with pre-diluted streptavidin- 
conjugated horseradish peroxidase (Zymed Laboratories) and then washed 
as above. Peroxidase activity was detected using 10 mg/mL 0.05% 
diaminobenzidine, 0.03% H 2 0 2 in Tris saline, pH 7.6. Sections were 
counterstained with haematoxylin, dehydrated with ethanol, cleared with 
xylene and mounted in DePeX. 

M11.9 reacts strongly with colonic epithelium, primarily with 
columnar cells of the surface epithelium (see FIG. 9A). Both goblet and 
columnar cells deep in the crypts are not stained by this antibody (see FIG. 
9A). In surface epithelial columnar cells M1 1 .9 reacted with the perinuclear 
cytoplasm, lateral cell membranes and most strongly as granular staining in 
the subapical cytoplasm (FIG. 9B). This localisation suggests reactivity with 
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precursor in the rough endoplasmic reticulum (perinuclear staining), reactivity 
with mature mucin on the lateral membranes at columnar cell junctions with 
other cells, and reactivity with processed mature mucin in granules for apical 
secretion or incorporation into the apical cell membrane. This pattern of 
5 reactivity is distinct from that seen for other known mucin core proteins. 

M12.15 also reacts strongly with colonic epithelium, and like 
M1 1 .9 it reacts primarily with columnar cells of the surface epithelium (see 
FIG. 9C). However, M12.15 gave a more diffuse cytoplasmic staining pattern 
than that seen with M1 1 .9, although, like M1 1 .9, the strongest staining was 

10 in the apical cytoplasm. 

Immunohistochemistry in normal colonic mucosa with these 
antibodies demonstrates protein expression of the MUC11 and MUC12 gene, 
supporting the mRNA studies. The co-expression of MUC1 1 and MUC12 in 
normal colon is also consistent with the RT-PCR data showing similar levels 

15 of relative expression of these two mucin genes in different regions of the 
intestinal tract. 

EXAMPLE 14: Expression of MUC11 bv in situ h ybridization 

14.1 Methods 

Optimisation of conditions for in situ hybridisation, outlined 

20 below, was based upon published techniques (Rex & Scotting, 1994, 
Biochemica 3 24, which is herein incorporated by reference). Riboprobes 
were made by in vitro transcription of DNA with SP6 and T7 RNA 
polymerases and incorporation of a digoxigenin-labelled uridine triphosphate 
(DIG-UTP). The orientation of inserts in pGEM-T was established by 

25 sequencing. Insert in the antisense direction and thus complementary to RNA 
template was the hybridisation probe and insert in the sense direction was 
used as a negative control. 1 mg of purified linearised plasmid pGEM-T was 
labelled in the presence of 1/10 volume 10 x transcription buffer, 1/10 volume 
10 x NTP mix (1 mM ATP, CTP, GTP, 0.65 mM UTP, 0.35 mM DIG-UTP), 

30 10U RNase inhibitor and 40U of either SP6 or T7 RNA polymerase. The 
reaction was carried out at 37°C for 2 hours and terminated by addition of 2 
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ul of 0.2M EDTA. Probes were ethanol precipitated with 1/1 1 volume 4M LiCI 
and placed at -20°C for 2 hours. They were then centrifuged at 12,000 g for 
30 min at 4°C. Pellets were washed with 70% ethanol, air-dried for 10 min 
and resuspended in 100ml of RNase-free water. 

5 Paraffin-embedded junctional tissue specimens were sectioned 

at 4 urn onto sterile water and affixed to Vectabond-treated slides (Vector 
Laboratories). Sections were dewaxed in xylene, rehydrated and then 
incubated for 5 min in 0.2 N HCI. HCI treatment contributes to an 
improvement in the signal to noise ratio by extraction of proteins and partial 

1 0 hydrolysis of target sequences. Slides were washed in sterile water for 5 min, 
followed by 5 min in PBT (PBS and 0.1% Tween 20). Sections were then 
incubated in proteinase K (5 mg/ml) at 37°C for 15 min and washed briefly in 
3 x PBT. They were fixed in 4% paraformaldehyde for exactly 20 min and 
prehybridised for 4 hours at 70°C in hybridisation buffer (50% formamide, 5 

1 5 x SSC, 1 % SDS, 500 mg/mL tRNA, 50 mg/mL heparin). Denatured probe (0.5 
mg/section) was added to hybridation buffer and sections hybridised 

overnight at 70°C. 

Sections were washed in 2 x wash solution 1 (50% formamide, 
5 x SSC, 1% SDS) at 65°C followed by 2 x washes in wash solution 2 (50% 
20 formamide, 2 x SSC) also at 65°C. Sections were then incubated with anti- 
digoxygenin-AP antibody at 1/2000 in PBS overnight at 40°C. 

Excess antibody was removed by 3 x 20 minute washes in PBT. 
Sections were then washed 2 x 20 minute in NTMT buffer (1 00 mM Tris, (pH 
9.5), 50 mM MgCI 2 , 100 mM NaCI, 0.1% Tween 20, 2 mM levamisole). 
25 Hybridisation was visualised with NBT and BCIP overnight at room 
temperature. The reaction was stopped by immersion of slides into 1 x TE 
and sections lightly counterstained in eosin. Sections were then dehydrated 
through ethanols of increasing concentration to xylene and mounted in 
DePeX. Slides were photographed within 3 days due to fading of the signal 

30 with time. 

14? Detection of MUC1 1 mRNA bv in situ hybridization 
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Intense signal for MUC11 was observed in the columnar cells 
of the surface epithelium in all specimens of the normal colon. However, it 
was not possible to conclusively identify positive signal in the goblet cells of 
the colonic epithelium. Transcripts for MUC11 were not detected in adjacent 
5 carcinoma of several junctional tissue specimens (an example is shown in 
FIG. 9D), thus confirming the findings of the differential display and Northern 
blot analyses. 

EXAMPLE 15 : Expression ofMUC11 and MUC12 in normal colon 

bv RT-PCR 

10 The results of RT-PCR experiments to determine the 

expression patterns of MUC11 and MUC12 genes in normal colonic 

epithelium are shown FIG. 10. 

MUC11 and MUC12 are predominantly expressed in the colon, 

although the data in FIG. 10 show that in fact their levels of expression vary 
15 within the colon. In this regard, a progressive increase (3-4 fold) in the 

expression of both MUC1 1 and MUC12 was seen from the right colon to the 

rectum. 

EXAMPLE 16 : Expression of MUC11 and MUC12 in CRC bv RT- 
PCR and Northern hybridization 

20 The expression patterns of MUC1 1 and MUC12 in CRC were 

investigated by RT-PCR, and the results are shown in FIG. 11. After 40 
rounds of amplification, MUC11 expression was observed in all CRC cell 
lines under investigation. Similarly, MUC12 expression was observed in all 
cell lines, although two cell lines, SW620 and SW116 revealed low levels of 

25 expression. 

These observations, together with the downregulation data, 
show that although these genes are downregulated in CRCs, they are still 
detectable in CRC cell lines. In contrast to the normal gastrointestinal tract 
and IBD tissues, the expression of MUC11 and MUC12 in CRCs and in CRC 
30 cell lines show patterns of expression distinct form each other. 

Referring to FIG. 14, the results of Northern analysis with a 
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dd34 (MUC11) probe showed that in nucleic acid extracts obtained from 
colonic tissue of four (4) of the (6) CRC patients tested, the level of MUC1 1 
mRNA expression was lower relative to normal colonic tissue from the same 
patients. Similarly, MUC12 mRNA was downregulated in three (3) of five (5) 
CRC patients (data not shown). 

Such quantitative (e.g. downregulation of these genes and 
differential downregulation expression patterns of MUC11 and MUC12) and 
also qualitative changes of these genes, e.g. mutations, could be used for 
diagnostic and prognostic testing in CRC. 

EXAMPLE 17 : Expression ofMUC11 andMUC12 in IBD by RT-PCR 

The expression patterns of MUC11 and MUC12 in IBD were 
investigated by RT-PCR, and the results are shown in FIG. 12. Cytokeratin 
20, (CK20) a colonic epithelial marker, was employed as a loading control 
due to the variable epithelial content of IBD tissues. 'N' denotes tissues 
which appear macroscopically normal and 'D' refers to tissues reported to 
have IBD. 'CA' refers to the caecum, 'CO' the colon, 'LC the left colon, TC 
the transverse colon, 'RS' the recto-sigmoid colon, 'SI' the small intestine, 
'IL'denotes the ileum and 'IP' an ileal pouch. 

Two patients, patient 1 and patient 4, show 3-4 fold upregulated 
expression of MUC11 and MUC12 in diseased tissues, compared with the 
same intestinal region observed in the 3 normal controls. Patient 6, who has 
a history of severe ulcerative colitis in the right colon, also revealed 
approximately 3-fold upregulated expression of MUC11 and MUC12 
compared to the right colon observed in the normal controls. 

There is coordinate regulation of Mucin expression in the 
normal gastrointestinal tract as well as in IBD tissues and upregulation of 
both Mucin genes was observed in 3/10 patients. Given the documented 
quantitative changes in the expression of MUC11 and MUC12, their 
expression levels may form the basis of useful diagnostic and prognostic 
testing for this disease. Qualitative changes in these genes, eg. mutations 
may also be useful markers for IBD. 
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EXAMPLE 18 : Expression ofMUC11 and MUC12 in BC bv RT-PCR 

The expression patterns of MUC11 and MUC12 in BC tissue 
were investigated by RT-PCR, and the results are shown in FIG. 13. After 40 
rounds of amplification, MUC11 expression was identified in all breast cancer 

5 cell lines under investigation; at low levels in BT-20, DU4475, MDA-MB-435 
and ZR-75-30 cell lines and at higher levels in the remaining nine cell lines. 
Eight of the cell lines showed MUC11 expression higher than the normal 
colonic cDNA positive control. MUC11 is clearly highly expressed by most 
breast cancers and may impact upon the behaviour of the breast cancer 

1 0 cells. MUC1 1 may also be secreted by breast cancers and detection in serum 
could form the basis of diagnostic and prognostic testing for breast cancer. 
MUC12 expression was only readily identifiable in one breast cancer cell 
line, MCF7, although faint bands were observed for BT20, KPL-1 and MA1 1 
cell lines. 

15 EXAMPLE 19 : Experimental Summary 

Differential display has been used to identify two partial cDNAs, 
which encode novel colonic mucin-like proteins. Expression of both cDNAs, 
designated MUC11 and MUC12 by the Human Nomenclature Committee, 
was commonly downregulated in colorectal cancers. 

20 MUC11 and MUC12 were mapped by FISH to chromosome 

band 7q22. The location of another mucin gene, MUC3, at 7q22, suggests 
the identification of a new cluster of mucin genes at this locus. Interestingly, 
four genes encoding gel-forming mucins are found in a cluster on 
chromosome 1 1 and these genes appear to have originated from a common 

25 ancestral gene. While the mucin cDNAs mapped to 7q22 most likely 
represent separate genes, it is also possible that they are produced as a 
result of alternative mRNA splicing from a single, large mucin gene. Northern 
blot analysis for MUCH. MUC12 and MUC3 shows that these encode large 
transcripts, estimated to be greater than 12 kb. 

30 Multiple tissue RNA analysis showed no cross-reactivity 

between MUCH, MUC12 or MUC3. MUC11 and MUC12 showed 
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predominant expression in the colon, while MUC3 was predominantly 
expressed in the small intestine and at very low levels in the colon. This 
expression pattern constitutes an important point of distinction between 
MUC11 and MUC12 genes of the present invention and MUC3. Furthermore, 
the sequences of MUC11 and MUC12 are not homologous with any other 
human mucin genes, but show some degree of similarity within their variable 
tandem repeat regions to each other (71% over 653 bp). However, their clear 
differential expression patterns in normal and tumor tissues as well as tumor 
cell lines, show that they are distinct from each other, and from MUC3. 

While both MUC11 and MUC12 contain variable repeat regions 
typical of mucins, MUC12 is putatively a transmembrane mucin with features 
suggesting an involvement in growth regulation, a largely unrecognized 
function in human mucins. MUC12 is only the fourth human membrane- 
anchored epithelial mucin to be described to date, along with MUC1 , MUC3 
and MUC4. MUC1 has been shown to be involved in cell signaling via 
multiple tyrosine phosphorylation sites on its highly conserved cytoplasmic 
tail (Zrihan-Licht et a/., 1994, FEBS Lett. 356 130). At its carboxyl terminus, 
MUC12 possesses a cytoplasmic tail containing a YNNF sequence (amino 
acids 557-560 in FIG. 2) which is similar to motifs recognized by SH2 
domain-containing proteins (Songyang et a/., Mol. Cell. Biol. 14 2777), 
suggesting that MUC12, like MUC1, may be involved in signal transduction. 

The deduced amino acid sequence of the partial MUC11 cDNA 
was composed entirely of serine/threonine-rich tandem repeats. There is a 
similarity between the tandem repeat consensus sequences of MUC1 1 (FIG. 
4) and MUC12 (FIG. 2) and these also show limited homology to the MUC3 
repeat (ITTETTSHSTPSFTSS). These similarities are consistent with 
evolution from a common ancestral gene. MUCH is more widely expressed 
than MUC12 and MUC3 however, with RNA detected in gastrointestinal, 
respiratory, reproductive and urinary tracts, and unexpectedly in the liver and 
thymus. 

The physiological roles of MUCH and MUC12 in colonic 
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epithelium are unknown. MUC11 and MUC12 are commonly downregulated 
in colorectal cancer suggesting they may play a role in epithelial cell growth 
modulation and/or differentiation. At present, it is not possible to comment on 
whether downregulation of these genes is related to stage of tumor 
5 progression, as only 20 patients were analyzed in this study. However, 
downregulation appears to be so frequent, that it may be an early event in 
tumorigenesis. Given the co-localization of the MUC11 and MUC12 genes 
on chromosome 7q22, it is possible that their expression is co-ordinately 
regulated and hence they are simultaneously downregulated in a large 
10 proportion of colorectal cancers. The effect of downregulation of these 
mucins on normal colonic epithelial cells could be substantial. Mucins are 
believed to protect epithelial cells from attack by pathogenic organisms and 
from mechanical and chemical damage. Therefore, reduced expression of 
these mucins could expose colonic epithelial cells to the harsh environment 
1 5 of the intestinal lumen. Furthermore, loss of a transmembrane mucin such as 
MUC12 may also contribute to loss of critical cell signaling. 

The location of these two novel mucin genes on chromosome 
7q22 may have significance for two non-malignant epithelial diseases where 
aberrant mucin expression and/or function is a recognized component of 
20 pathology, namely, inflammatory bowel disease and cystic fibrosis. 
Susceptibility genes for inflammatory bowel disease have been located to 
chromosomes 3, 12 and 7q22 (Satsangi et al., 1996, Nature Genet. 14 199). 
Thus, MUC11 and MUC12 must be considered candidates for involvement 
in inflammatory bowel disease given their chromosomal localization, 
25 expression in normal colon, and the documented alterations in mucins in this 
disease (Rhodes, 1997, QJM 90 79). Mucins may also play a role in cystic 
fibrosis as patients with the same CFTR gene mutation do not demonstrate 
exactly the same phenotype in terms of mucus obstruction. The existence of 
modifier genes has been postulated and mucin genes are obvious 
30 candidates (Harris & Reid, 1997, J. Med. Genet. 35 82). A murine Mucin 
qene that shows C-terminal homology with MUC12 has recently been shown 
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to be a major constituent of obstructive mucus in the gastrointestinal tract of 
mice with CF (Parmley et al., 1998, J. Clin. Invest 102 1798). 

The CFTR gene lies in the adjacent chromosome band (7q31 ) 
to the MUC3, MUC11 and MUC12 genes. While the significance of these 
5 findings is not clear, MUC1 1 and MUC12, which are expressed in many of 
the tissues affected by cystic fibrosis, should be considered as candidate 
modifier genes involved in the aetiology of this disease. 

Mucins are encoded by large genes which have proved difficult 
to clone by conventional methods due to the repetitive nature of their tandem 
10 repeat regions. Hereinbefore, the present inventors have unexpectedly 
identified by differential display two partial cDNAs which represent novel 
mucin genes that are predominantly expressed in colonic epithelium, both of 
which are downregulated in colorectal cancer. In this regard, MUC11 and 
MUC12 differ from the other mucin gene located on chromosome 7q22, 
1 5 MUC3. These findings together with the sequence homology between the 
MUC12 EGF-like domain and EGF receptor-binding growth factors, suggest 
MUC1 1 and MUC12 may function as growth regulators in colonic epithelium. 
Downregulation of these two novel mucin genes could be an important and 
previously unrecognized step in colorectal carcinogenesis. 
20 Those skilled in the art will appreciate that the invention 

described herein is susceptible to variations and modifications other than 
those specifically described. It is to be understood that the invention 
includes all such variations and modifications. The invention also includes 
all of the steps, features, compositions and compounds referred to or 
25 indicated in this specification, individually or collectively, and any and all 
combinations of any two or more of said steps or features. 
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CODE 


Nmorn 


Nmphe 


Nmpro 


Nmser 


Nmthr 


Nmtrp 


Nmtyr 


Nmval 


Nmetg 


Nmtbug 


NIe 


Nva 


Maib 


Mgabu 


Mchexa 


Mcpen 


NON-CONVENTIONAL 
AMINO ACID 


L-N-methylornithine 


L-N-methylphenylalanine 


L-N-methylproline 


L-N-methylserine 


L-N-methylthreonine 


L-N-methyltryptophan 


L-N-methyltyrosine 


L-N-methylvaline j 


L-N-methylethylglycine 


L-N-methyl-t-butylglycine 


L-norleucine 


L-norvaline 


a-methyl-aminoisobutyrate 


a-methyl-Y-aminobutyrate 


a-methylcyclohexylalanine 


a-methylcylcopentylalanine 


CODE 


Dglu 


Dhis 


Dile 


DIeu 


Dlys 


Dmet 


Dorn 


Dphe 


Dpro 


Dser 


Dthr 


Dtrp 


Dtyr 


Dval 


Dmala 


Dmarg 


NON-CONVENTIONAL 
AMINO ACID 


D-glutamic acid 


D-histidine 


D-isoleucine 


D-leucine 


D-lysine 


D-methionine 


D-ornithine 


D-phenylalanine 


D-proline 


D-serine 


D-threonine 


D-tryptophan 


D-tyrosine 


D-valine 


D-a-methylalanine 


D-a-methylarginine 
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CODE 


Manap 


Mpen 


Nglu 


Naeg 


Norn 


Nmaabu 


Anap 


Nphe 


Ngln 


Nasn 


Nglu 


Nasp 


Ncbut 


Nchep 


Nchex 


Ncdec 


NON-CONVENTIONAL 
AMINO ACID 


a-methyl-a-napthylalanine 


a-methylpenicillamine 


N-(4-aminobutyl)glycine 


N-(2-aminoethyl)glycine 


N-(3-aminopropyl)glycine 


N-amino-a-methylbutyrate 


a-napthylalanine 


N-benzylglycine 


N-(2-carbamylethyl)glycine 


N-(carbamylmethyl)glycine 


N-(2-carboxyethyl)glycine 


N-(carboxymethyl)glycine 


N-cyclobutylglycine 


N-cycloheptylglycine 


N-cyclohexylglycine 


N-cycIodecylglycine 


CODE 


Dmasn 


Dmasp 


Dmcys 


Dmgln 


Dmhis 


Dmile 


Dmleu 


Dmiys 


Dmmet 


Dmom 


Dmphe 


Dmpro 


Dmser 


Dmthr 


Dmtrp 


Dmty 


NON-CONVENTIONAL 
AMINO ACID 


D-a-methylasparagine 


D-a-methylaspartate 


D-a-methylcysteine 


D-a-methylglutamine 


D-a-methylhistidine 


D-a-methylisoleucine 


(D 

c 

*o 

0) 

H 
-«-» 

d) 
E 

I 

Q 


D-a-methyllysine 


D-a-methylmethionine 


D-a-methylornithine 


D-a-methylphenylalanine 


D-a-methylproline 


D-a-methylserine 


D-a-methylthreonine 


D-a-methyltryptophan 


D-a-methyltyrosine 
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CODE 


Ncdod 


Ncoct 


Ncpro 


Ncund 


Nbhm 


Nbhe 


Narg 


Nthr 


Nser 


Nhis 


Nhtrp 


Nmgabu 


Dnmmet 


Nmcpen 


Dnmphe 


Dnmpro 


NON-CONVENTIONAL 
AMINO ACID 


N-cylcododecylglycine 


N-cyclooctylglycine 


N-cyclopropylglycine 


N-cycloundecylglycine 


N-(2,2-diphenylethyl)glycine 


N-(3,3-diphenylpropyl)glycine 


N-(3-guanidinopropyl)glycine 


N-(1-hydroxyethyl)giycine 


N-(hydroxyethyl))glycine 


N-(imidazolylethyl))glycine 


N-(3-indolylyethyl)glycine 


0) 

i_ 
>» 

' 

=3 
_Q 
O 

c 

"e 

CO 

t 

>- 

Q) 

E 
i 

z 


D-N-methylmethionine 


N-methylcyclopentylalanine 


D-N-methylphenylalanine 


D-N-methylproline 


v CODE 


Dmval 


Dnmala 


Dnmarg 


Dnmasn 


Dnmasp 


Dnmcys 


Dnmgin 


Dnmglu 


Dnmhis 


Dnmile 


Dnmleu 


Dnmlys 


Nmchexa 


Dnmorn 


Nala 


Nmaib 


< 

z 

Z < r 

UJ ••• 
> O 
z z 
O s 

9< 

z 
o 

z • 


D-a-methylvaline 


D-N-methylalanine 


D-N-methylarginine 


D-N-methylasparagine 


D-N-methylaspartate 


D-N-methylcysteine 


D-N-methylglutamine 


D-N-methylglutamate 


D-N-methylhistidine 


D-N-methylisoleucine 


D-N-methylleucine 


D-N-methyllysine 


N-methylcyclohexylalanine 


D-N-methylomithine 


N-methylglycine 


N-methylaminoisobutyrate 
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CODE 


Dnmser 


Dnmthr 


Nval 


Nmanap 


Nmpen 


Nhtyr 


Ncys 


Pen 


Mala 


Masn 


Mtbug 


Metg 


Mglu 


Mhphe 


Nmet 


Mlys 


NON-CONVENTIONAL 
AMINO ACID 


D-N-methylserine 


D-N-methylthreonine 


N-(1-methylethyl)glycine 


N-methyla-napthylalanine 


N-methylpenicillamine 


N-(p-hydroxyphenyl)glycine 


N-(thiomethyl)glycine 


penicillamine 

. — 1 


L-a-methylalanine 


L-a-methylasparagine 


L-a-methyl-f-butylglycine 


L-methylethylglycine 


L-a-methylglutamate 


L-a-methylhomophenylalanine 


N-(2-methylthioethyl)glycine 


<d 
c 
*</) 
_>» 

E 
i 

t 

-J 


CODE 


Nile 


NIeu 


Dnmtrp 


Dnmtyr 


Dnmval 


Gabu 


Tbug 


O) 
LLJ 


Hphe 


Marg 


Masp 


Mcys 


Mgln 


Mhis 


Mile 


MIeu 


NON-CONVENTIONAL 
AMINO ACID 


N-(1 -methylpropyl)glycine 


N-(2-methylpropyl)glycine 


D-N-methyltryptophan 


D-N-methyltyrosine 


D-N-methylvaline 


Y-aminobutyric acid 


L-f-butylglycine 


L-ethylglycine 


L-homophenylalanine 1 


L-a-methylarginine 


L-a-methylaspartate 


L-a-methylcysteine 


L-a-methylglutamine 


L-a-methylhistidine 


L-a-methylisoleucine 


0) 

c 

Q) 
-C 

-•— » 

a> 
E 

1 

? 

-J 
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CODE /* 


Mnle 


Morn 


Mpro 

■ 


Mthr 


Mtyr 


Nmhphe 


Nnbhe 


















ne 






ethylamino)cyclopropane 


NON-CONVENTIONAL 
AMINO ACID 


L-a-methylnorleucine 


L-a-methylornithine 


L-a-methylproline 


L-a-methylthreonine 


L-a-methyltyrosine 


L-N-methylhomophenylalani 


N-(N-(3,3-diphenylpropyi; 


CD 

c 

'o 

O) 
CD 

E 
E 

03 

XJ 
i— 

CO 
O 


CODE 


Mmet 


Mnva 


Mphe 


Mser 


Mtrp 


Mval ' 


Nnbhm 




Nmbc 


NON-CONVENTIONAL 
■ AMINO ACID 


L-a-methylmethionine 


L-a-methylnorvaline 


CD 

c 

*c 

03 

03 

>* 
C 
CD 
SZ 
CL 

CD 

E 

I 

O 
t 

_J 


L-a-methylserine 


L-a-methyltryptophan 


L-a-methylvaline 


N-(N-(2 ( 2-diphenylethyl) 


carbamylmethyOglycine 


1 -carboxy-1 -(2,2-diphenyl- 
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TABLE 2 



Hybridoma 


Immunising 
peptide 


Isotype 


Peptide ; 
^activity i 


j^nmuno- 
: histochemicai i 
reactivity with 
; fcolon 


M11.9 


MUC11 


igM 


++++ 


Reacts with paraffin 
embedded tissue, 

rpartivitv pnhannsd 

icdwiiviiy ci ii icii iocu 

by pre-treatment of 
sections with 
neuraminidase 
which removes 

sialic acid groups. 


M12.15 


MUC12 


igM 


++++ 


Reacts with paraffin 
embedded tissue, 
reactivity enhanced 
by pre-treatment of 
sections with 
neuraminidase 
which removes 
sialic acid groups. 
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CLAIMS 

1 . An isolated MUC nucleic acid corresponding to a MUC gene 
located on human chromosome 7q22, or a mammalian chromosome 
structurally or functionally equivalent thereto, which MUC gene is normally 
predominantly expressed in the colon. 

2. The isolated MUC nucleic acid of Claim 1 which comprises a 
nucleotide sequence encoding an amino acid sequence which comprises 
SGLSEESTTSHSSPGSTHTTLSPASTTT(SEQ ID NO :1). 

3. The isolated MUC nucleic acid of Claim 2, wherein the nucleic 
acid comprises a nucleotide sequence encoding an amino acid sequence 
according to SEQ ID NO:3. 

4 The isolated MUC nucleic acid of Claim 2, wherein the nucleic 
acid includes a sequence of nucleotides according to SEQ ID NO: 2. 

5 The isolated MUC nucleic acid of Claim 1 , wherein the nucleic 
acid which comprises a nucleotide sequence which encodes an amino acid 
sequence whuch comprises SGLSQESTTFHSSPGSTETTLAPASTTT (SEQ 
ID NO: 4). 

6. The isolated MUC nucleic acid of Claim 5, wherein the nucleic 

acid comprises a nucleotide sequence which encodes an amino acid 
sequence according to SEQ ID NO:6. 

7 The isolated MUC nucleic acid of Claim 5, wherein the nucleic 
acid includes a sequence of nucleotides according to SEQ ID NO: 5. 

8 a MUC nucleic acid homolog which hybridizes to the isolated 
MUC nucleic acid of any one of Claims 2-7 under conditions of at least low 
stringency. 

g a MUC nucleic acid homolog which has at least 60% 

nucleotide sequence identity with the isolated MUC nucleic acid of any one 
of Claims 2-7. 

1 0. An isolated MUC polypeptide having an amino acid sequence 

according to SEQ ID NO: 3. 

-! 1 An isolated MUC polypeptide having an amino acid sequence 
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according to SEQ ID NO: 6. 

12. An isolated MUC polypeptide homolog which has at least 60% 
amino acid identity with the MUC polypeptide of Claim 10 or Claim 1 1 . 

13. An antibody specific for the MUC polypeptide or MUC 
polypeptide homolog of any one of Claims 10-12. 

14. An antibody according to Claim 13 which is a monoclonal 
antibody. 

15 a monoclonal antibody according to Claim 14, which 

monoclonal antibody is selected from the group consisting of M1 1 .9 and 
M12.15. 

16. A method of detecting the MUC polypeptide of Claim 10 or 
Claim 1 1 , including the steps of> 

(i) obtaining a sample from said mammal; 

(ii) forming a complex between said MUC polypeptide, if 
present in said sample, and an anti-MUC polypeptide 
antibody; and 

(iii) detecting said MUC polypeptide in said complex. 

1 7. The method of Claim 1 7, wherein the antibody is selected from 
the group consisting of M11.9 and M12.15. 

1 8. A method of detecting a MUC gene or a MUC gene transcript 

including the steps of> 

(i) obtaining a nucleic acid extract from said mammal; 

(ii) forming a hybrid nucleic acid comprising a MUC gene 
or a MUC gene transcript if present in said sample, and 
a corresponding isolated MUC nucleic acid according 
to Claim 1 , or a portion thereof; and 

(iii) detecting said hybrid nucleic acid. 

19. The method of Claim 18, wherein the isolated MUC nucleic 
acid has a nucleotide sequence selected from the group consisting of SEQ 
ID NO:2 and SEQ ID NO:5. 

20. A method of detecting a MUC gene or a MUC gene transcript 
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including the steps of:- 

(i) obtaining a nucleic acid extract from said mammal; 

(ii) using one or more primers, each having a nucleotide 
sequence corresponding to a distinct portion of the 
isolated MUC nucleic acid of Claim 1 , together with a 
polynucleotide sequence amplification technique, to 
produce a MUC gene amplification product from said 
extract; and 

(iii) detecting said MUC gene amplification product. 

21 . The method of Claim 20, wherein the one or more primers is 
selected from the group consisting of SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO: 10, SEQ ID NO: 11, SEQ ID NO: 14 and SEQ ID NO:15. 

22. The method of Claim 21 , wherein the polynucleotide sequence 
amplification technique is RT-PCR. 

23. Use of the isolated MUC nucleic acid of Claim 1 , or a portion 
thereof, to detect a polymorphism, mutation, deletion, truncation and/or 
expansion in a corresponding MUC gene or MUC gene transcript. 

24. A pharmaceutical composition comprising a pharmaceutical^ 
acceptable amount of the MUC polypeptide of Claim 10 or Claim 11, 
together with a pharmaceutically-acceptable carrier and/or diluent. 

25. A pharmaceutical composition comprising a pharmaceutical^ 
acceptable amount of an anti-MUC antibody according to any one of Claims 
13-15, together with a pharmaceutically-acceptable carrier and/or diluent. 

26. A method of treating of a mammal suffering from a disease 
condition, said method including the step of administering to said mammal 
a pharmaceutical composition according to Claim 24 or Claim 25 to thereby 
alleviate or prevent one or more symptoms of said disease condition in said 
mammal. 

27. A method of gene therapy of a mammal suffering from a 
disease condition, said method including the step of administering a gene 
therapy construct to said mammal, said gene therapy construct comprising 
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the isolated MUC nucleic acid of Claim 1, or a portion thereof, to thereby 
alleviate or prevent one or more symptoms of said disease condition in said 
mammal. 

28. The method of Claim 27, wherein the isolated MUC nucleic 
acid has a nucleotide sequence selected from the group consisting of SEQ 
ID NO:2 and SEQ ID NO:5. 

29. The method of any one of Claims 26-28, wherein said disease 
condition is associated with aberrant Mucin expression, altered properties 
of mucus or epithelial inflammatory processes involving Mucins 

30. The method of any one of Claims 26-28, wherein said disease 
condition is selected from the group consisting of colorectal cancer (CRC), 
cystic fibrosis (CF), inflammatory bowel disease (IBD), breast cancer (BC), 
Crohn's disease, ulcerative colitis, asthma and chronic bronchitis. 

31 . The method of Claim 30, wherein said disease condition is 
selected from the group consisting of colorectal cancer (CRC), cystic fibrosis 
(CF), inflammatory bowel disease (IBD) and breast cancer (BC). 

32. The method of any one of Claims 26-31 , wherein the mammal 
is a human. 
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Consensus tandem repeat sequence 

SGLSQESTTPHS3PGSTETTLSPA3TTT 

H S 

TLSPASMRS 9 
SS I SGEPTS LYSQAESTHTTAFPASTTT 37 
SGLSQESTTFHSKPGSTETTLSPGSITT 65 
SSFAQEFTTPHSQPGSALSTVS PASTTV 93 
PGLSEESTTFYSSPGSTETTAFSHSNTM^i 
SIHSQQSTPFPDSPGFTHTVL- PATLTT143 
TDIGQESTAFHSSSDATGTTPLPARSTA 176 
SDLVGEPTTFYISPSPTYTTLFPASSST204 
SGLTEESTTFHTSPSFTSTIVSTESLET232 

lapgl coegoiwngkocvcpogyvgyqc ^n 
lsplesfpvetpekl|atlgmtvkvtyr 288 
|ftekmndassqeyq|fstlfknrmdvv 316 
lkgdnlpqyrgvnirrll|gs ivvkndv 344 

I LEADYTLEYEELFENLAEIVKAKIM|e 372 
TRTTLLDPDSCRKAILCYSEEDTFVDSS 400 
VTPGFDFQEQ CTQKAAEGYTQFYYVDVL 428 
DGKLACVNKCTKGTKSQMNCNLGTCQLQ ^ 
RSGPRCLCPNTNTHWYWGETC EFNIAKS 484 
LVYG I VGAVMAVLLLAL I I LI I LFSL S Q 512 
RKRHREQYDVPQEWRKEGTPGI FQKTAI 540 
WEDQNLRESRFGLENAYNNFRPTLETVD 558 
SGTELHIQRPEMVASTV * 



FIG. 2 
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Consensus tandem repeat sequence 

SGLSEESTTSHSSPGSTHTTLSPASTTT 
F 

RNRPHTTAFPGSTTM 15 

PGVSQESTASHSSPGSTDTTLSPGSTTA 43 

SSLGPESTTFHSGPGSTETTLLPD&TTA 73. 

SGLLEASTPVHSSTGSPHTTLSPAGSTT 99 

--RQGESTTFQSWPNSKDTTPAP-PTTT^ 
SAFVEI.STTSHGSPSSTPTTHPSASSTT1S2 
LGRSEESTTVHSSPVATATTPSPARSTTxbo 
SGLVEESTTYHSSPGSTQTMHFPESDTT208 
SGRGEESTTSHSS--TTHTI-SSAPSTT233 
SALVEEPTSYHSSPGSTATTHFPDSSTT^ 

SGRSEESTASHSNQDATGTIVLPARSTT289 
SVLLGESTTSPISSGSMETTALPGSTTT^ 
PGLSEKSTTFHSSPRSPATTLSPASTTS 345 
SGVSEESTTSHSRPGSTHTTAFPDSTTT 373 
PGLSRHSTTSHSSPGSTDTTLLPASTTT 401 
SGPSQESTTSHSSPGSTDTALSPGSTTA429 
LSFGQESTTFHSSPGSTHTTLFPDSTTS 457 

SGIVEASTRVHSSTGSPRTTLSPASSTS485 
PGLQGESTAFQTHPASTHTTPSTP STAT 5J3 
-APVEESTTYHRSPSSTPTTHFPASSTT54Q 
SGHSEKSTIFHSSPDASGTTPSSAHSTTsgs 
SGRG-ESTTSRISPGSTEITTLPGSTTT 595 

PGLSEASTTFYSSPRSPTTTLSPASMTS623 
LGVGEESTTSRSQPGSTHSTVSPASTTTgg. 
PGLSEESTTVYSSSPGSTETTVFPRSTTfiTg 
TSVRGEEPTTFHSRPASTHTTLFTEDST 707 
TS GliTEESTAFPGSPASTQTGLPATLTT 735 
ADLGEESTTFPSSSGSTGTTLSPARSTT70 
SGLVGESTPSRLSPSSTETTTLPGSPTTTg! 
PSLSEKSTTFYTSPRSPDATLSPATTTS gl9 
SGVSEESSTSHSQPGSTHTTAFPDSTTT^ 
SGLSQEPKTSHSSQGSTEATLSPGSTTAbts 
SSLGQQSTTFHSSPGDTETTLLPDDTITgm 
SGLVEASTPTHSSTGSLHTTLTPASSTS 931 
AGLQEESTTFQSWPSSSDTTPSPPGP 957 



FIG. 4 



WO 00/04142 PCT/AU99/00579 

6/15 
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differential display 

screening 
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stringent 
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cl2 - 1043bp 



0 
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PCR screening of a 
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SEQUENCE LISTING 

<110> The Council of the Queensland Institute of Medical 
The Corporation of the Trustees of the Order of th 

<120> MUCINS 

<130> mucinqimr 

<140> 
<141> 

<150> PP4708 
<151> 1998-07-16 

<160> 18 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> REPEAT 
<222> (1) . . (28) 

<223> MUCH consensus tandem repeat sequence 



<400> 1 

Ser Gly Leu Ser Glu Glu Ser Thr Thr Ser His Ser Ser Pro Gly Ser 
1 5 



10 15 



Thr His Thr Thr Leu Ser Pro Ala Ser Thr Thr Thr 
20 25 



<210> 2 

<211> 2872 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) - • (2871) 



<400> 2 
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Arg Asn Arg Pro His Thr Thr Ala Phe Pro Gly Ser Thr Thr Met Pro 
15 10 15 

ggc gtc agt cag gaa tct aca get tec cac age age cca ggc tec aca 
Gly Val Ser Gin Glu Ser Thr Ala Ser His Ser Ser Pro Gly Ser Thr 
20 25 30 

gac aca aca ctg tec cct ggc agt acc aca gca tea tec ctt ggt cca 
Asp Thr Thr Leu Ser Pro Gly Ser Thr Thr Ala Ser Ser Leu Gly Pro 
35 40 45 

gaa tct act acc ttc cac age ggc cca ggc tec act gaa aca aca etc 
Glu Ser Thr Thr Phe His Ser Gly Pro Gly Ser Thr Glu Thr Thr Leu 
50 55 60 

tta cct gac aac acc aca gec tec ggc etc ctt gaa gca tct acg ccc 
Leu Pro Asp Asn Thr Thr Ala Ser Gly Leu Leu Glu Ala Ser Thr Pro 
65 70 75 80 

gtc cac age age act gga teg cca cac aca aca ctg tec cct gec ggc 288 
Val His Ser Ser Thr Gly Ser Pro His Thr Thr Leu Ser Pro Ala Gly 
85 9° 95 

tct aca acc cgt cag gga gaa tct acc acc ttc cag age tgg cct aac 336 
Ser Thr Thr Arg Gin Gly Glu Ser Thr Thr Phe Gin Ser Trp Pro Asn 
100 105 110 

teg aag gac act acc cct gca cct cct act ace aca tea gec ttt gtt 384 
Ser Lys Asp Thr Thr Pro Ala Pro Pro Thr Thr Thr Ser Ala Phe Val 
115 120 125 



gag eta tct aca acc tec cac ggc age ceg age tea act cca aca acc 
Glu Leu Ser Thr Thr Ser His Gly Ser Pro Ser Ser Thr Pro Thr Thr 
130 135 14° 

cac ttt tct gec age tec aca acc ttg ggc cgt agt gag gaa teg aca 
His Phe Ser Ala Ser Ser Thr Thr Leu Gly Arg Ser Glu Glu Ser Thr 
145 150 155 I 60 

aca gtc cac age age cca gtt gca act gca aca aca ccc teg cct gec 
Thr Val His Ser Ser Pro Val Ala Thr Ala Thr Thr Pro Ser Pro Ala 
165 170 1" 

cgc tee aca acc tea ggc etc gtt gaa gaa tct acg acc tac cac age 
Arg ser Thr Thr Ser Gly Leu Val Glu Glu Ser Thr Thr Tyr His Ser 
180 185 I 90 



96 



144 



192 



240 



432 



480 



528 



576 
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Ser Pro Gly Ser Thr Gin Thr Met His Phe Pro Glu Ser Asp Thr Thr 
195 200 205 

tea ggc cgt ggt gaa gaa tea aca act tec cac age age aca aca cac 
Ser Gly Arg Gly Glu Glu Ser Thr Thr Ser His Ser Ser Thr Thr His 
210 215 220 

aca ata tct tea get cct age ace aca tct gec ctt gtt gaa gaa cct 
Thr lie Ser Ser Ala Pro Ser Thr Thr Ser Ala Leu Val Glu Glu Pro 
225 230 235 240 

acc age tac cac age age ccg ggc tea act gca aca aca cac ttc cct 
Thr Ser Tyr His Ser Ser Pro Gly Ser Thr Ala Thr Thr His Phe Pro 
245 250 255 

gac age tec aca acc tea ggc cgt agt gag gaa tea aca gca tec cac 
Asp Ser Ser Thr Thr Ser Gly Arg Ser Glu Glu Ser Thr Ala Ser His 
260 265 270 

age aac caa gac gca acg gga aca ata gtc eta cct gee cgc tec aca 
Ser Asn Gin Asp Ala Thr Gly Thr He Val Leu Pro Ala Arg Ser Thr 
275 280 285 

acc tea gtt ctt ctt gga gaa tct acg acc tea ccc ate agt tea ggc 
Thr Ser Val Leu Leu Gly Glu Ser Thr Thr Ser Pro He Ser Ser Gly 
290 295 300 



tea atg gaa acg aca gcg tta ccc ggc agt acc aca acg cca ggc etc 
Ser Met Glu Thr Thr Ala Leu Pro Gly Ser Thr Thr Thr Pro Gly Leu 
305 



310 315 320 



345 350 



acc acc tec cac age cga cca ggc tea acg cac aca aca gca ttc cct 
Thr Thr Ser His Ser Arg Pro Gly Ser Thr His Thr Thr Ala Phe Pro 
355 360 365 

gac age acc acc acg cca ggc etc agt egg cat tct aca act tec cac 
Asp Ser Thr Thr Thr Pro Gly Leu Ser Arg His Ser Thr Thr Ser His 
370 375 380 



672 



720 



768 



816 



864 



912 



960 



1008 



agt gag aaa tct acc act ttc cac agt age ccc aga tea cca gec aca 
ser Glu Lys Ser Thr Thr Phe His Ser Ser Pro Arg Ser Pro Ala Thr 
325 330 335 

aca etc tea cct gee age acg aca age tea gge gtc agt gaa gaa tec 1056 
Thr Leu Ser Pro Ala Ser Thr Thr Ser Ser Gly Val Ser Glu Glu Ser 
340 



1104 



1152 



1200 
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Ser Ser Pro Gly Ser Thr Asp Thr Thr Leu Leu Pro Ala Ser Thr Thr 
385 390 395 400 

acc tea ggc ccc agt cag gaa tea aca act tec cac age age cca ggt 
Thr Ser Gly Pro Ser Gin Glu Ser Thr Thr Ser His Ser Ser Pro Gly 
405 410 415 

tea act gac aca gca ctg tec cct ggc agt acc aca gee tta tec ttt 
Ser Thr Asp Thr Ala Leu Ser Pro Gly Ser Thr Thr Ala Leu Ser Phe 
420 425 430 

ggt caa gaa tct aca acc ttc cac age age cca ggc tec act cac aca 
Gly Gin Glu Ser Thr Thr Phe His Ser Ser Pro Gly Ser Thr His Thr 
435 440 445 

aca etc ttc cct gac age acc aca age tea ggc ate gtt gaa gca tet 
Thr Leu Phe Pro Asp Ser Thr Thr Ser Ser Gly He Val Glu Ala Ser 
450 455 460 

aca cgc gtc cac age age act ggc tea cca egc aca aca ctg tec cct 
Thr Arg Val His Ser Ser Thr Gly Ser Pro Arg Thr Thr Leu Ser Pro 
465 470 475 

gec age tee aca age cct gga ctt cag gga gaa tct ace gee ttc cag 
Ala Ser Ser Thr Ser Pro Gly Leu Gin Gly Glu Ser Thr Ala Phe Gin 
485 490 495 

acc cac cca gee tea act cac acg aeg cct tea act cct age acc gca 
Thr His Pro Ala Ser Thr His Thr Thr Pro Ser Thr Pro Ser Thr Ala 



500 



505 510 



aca gee cct gtt gaa gaa tct aca ace tac cac cgc age cca age teg 

Thr Ala Pro Val Glu Glu Ser Thr Thr Tyr His Arg Ser Pro Ser Ser 
515 520 525 

act cca aca aca cac ttc cct gec age tee aca act teg ggc cac agt 

Thr Pro Thr Thr His Phe Pro Ala Ser Ser Thr Thr Ser Gly His Ser 



530 



535 540 



gag aaa tea aca ata ttc cac age age cca gat gca agt gga aca aca 
Glu Lys Ser Thr lie Phe His Ser Ser Pro Asp Ala Ser Gly Thr Thr 
545 

ccc tea tct gee cac tee aca acc tea ggt cgt gga gaa tct aca acc 
Pro ser Ser Ala His Ser Thr Thr Ser Gly Arg Gly Glu Ser Thr Thr 
565 570 575 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 
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ser Arg lie Ser Pro Gly Ser Thr Glu He Thr Thr Leu Pro Gly Ser 
580 585 590 

acc aca aca cca ggc etc agt gag gca tct acc acc ttc tac age age 1824 
Thr Thr Thr Pro Gly Leu Ser Glu Ala Ser Thr Thr Phe Tyr Ser Ser 
595 600 605 



ccc aga tea cca acc aca aca etc tea cct gec age atg aca age eta 
Pro Arg Ser Pro Thr Thr Thr Leu Ser Pro Ala Ser Met Thr Ser Leu 
610 615 620 

ggc gtc ggt gaa gaa tec acc acc tec cgt age caa cca ggt tct act 
Gly Val Gly Glu Glu Ser Thr Thr Ser Arg Ser Gin Pro Gly Ser Thr 
625 630 635 640 

cac tea aca gtg tea cct gec age acc acc acg cca ggc etc agt gag 
His Ser Thr Val Ser Pro Ala Ser Thr Thr Thr Pro Gly Leu Ser Glu 
645 650 655 

gaa tct acc acc gtc tac age age age cca ggc tea act gaa acc aca 
Glu Ser Thr Thr Val Tyr Ser Ser Ser Pro Gly Ser Thr Glu Thr Thr 
660 665 670 

gtg ttc cct cge age ace aca aee tea gtt cgt ggt gaa gag cct aca 
Val Phe Pro Arg Ser Thr Thr Thr Ser Val Arg Gly Glu Glu Pro Thr 
675 680 685 

acc ttc cac age egg cca gee tea act cac aca aca etg ttc act gag 
Thr Phe His Ser Arg Pro Ala Ser Thr His Thr Thr Leu Phe Thr Glu 
690 695 700 

gac age ace acc teg ggc etc act gaa gaa tct aca gec ttc ccc ggc 
Asp Ser Thr Thr Ser Gly Leu Thr Glu Glu Ser Thr Ala Phe Pro Gly 
705 710 715 720 

age cca gec tec acc caa aca ggg tta cct gec aca etc aca acc gca 
ser Pro Ala Ser Thr Gin Thr Gly Leu Pro Ala Thr Leu Thr Thr Ala 



725 



730 735 



gac etc ggt gag gaa tea act acc ttt ccc age age tea ggc tea act 
Asp Leu Gly Glu Glu Ser Thr Thr Phe Pro Ser Ser Ser Gly Ser Thr 
740 745 750 

gga aca aca etc tea cct gec cge tec acc acc tct ggc etc gtt gga 
Gly Thr Thr Leu Ser Pro Ala Arg Ser Thr Thr Ser Gly Leu Val Gly 
755 760 765 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



TIC") 
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Glu Ser Thr Pro Ser Arg Leu Ser Pro Ser Ser Thr Glu Thr Thr Thr 
770 775 780 

tta ccc ggc agt ccc aca aca cca age etc agt gag aaa tea acc acc 
Leu Pro Gly Ser Pro Thr Thr Pro Ser Leu Ser Glu Lys Ser Thr Thr 
785 790 -795 800 

ttc tac act age ccc aga tea cca gat gca aca etc tea cct gca acc 
Phe Tyr Thr Ser Pro Arg Ser Pro Asp Ala Thr Leu Ser Pro Ala Thr 
805 BIO 815 

aca aca age tea ggc gtc age gaa gaa tec age aca tec cac agt caa 
Thr Thr Ser Ser Gly Val Ser Glu Glu Ser Ser Thr Ser His Ser Gin 
820 825 830 

cca ggc tea acg cac aca aca gcg ttc cet gac age acc acc acc tea 
Pro Gly Ser Thr His Thr Thr Ala Phe Pro Asp Ser Thr Thr Thr Ser 
835 840 845 

ggc etc agt cag gaa cct aaa act tec cac age age caa ggc tea aca 
Gly Leu Ser Gin Glu Pro Lys Thr Ser His Ser Ser Gin Gly Ser Thr 
850 855 860 

gag gca aca ctg tec cct ggc agt acc aca gec tea tec ett ggt caa 
Glu Ala Thr Leu Ser Pro Gly Ser Thr Thr Ala Ser Ser Leu Gly Gin 
865 

caa tct aca acc ttc cac age age cca ggc gac act gaa acc aca etc 
Gin Ser Thr Thr Phe His Ser Ser Pro Gly Asp Thr Glu Thr Thr Leu 
885 890 895 

tta cct gat gac acc ata acc tea ggc etc gtg gag gca tct aca ccc 
Leu Pro Asp Asp Thr He Thr Ser Gly Leu Val Glu Ala Ser Thr Pro 
900 905 910 

ace cac age age act ggc teg eta cac aca aca ctg ace cct gec age 
Thr His Ser Ser Thr Gly Ser Leu His Thr Thr Leu Thr Pro Ala Ser 
915 920 925 

tec aca age get ggc ctt eag gaa gaa tct act act ttc cag age tgg 
Ser Thr Ser Ala Gly Leu Gin Glu Glu Ser Thr Thr Phe Gin Ser Trp 
930 935 940 

cca age tea agt gac aca aca cct tea cct ccc ggc ccg g 
Pro Ser Ser Ser Asp Thr Thr Pro Ser Pro Pro Gly Pro 
945 950 955 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



2832 



2872 
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<210> 3 
<211> 957 
<212> PRT 

<213> Homo sapiens 



<400> 3 

Arg Asn Arg Pro His Thr Thr Ala Phe Pro Gly Ser Thr Thr Met Pro 
15 10 



15 



Gly Val Ser Gin Glu Ser Thr Ala Ser His Ser Ser Pro Gly Ser Thr 
20 25 30 

Asp Thr Thr Leu Ser Pro Gly Ser Thr Thr Ala Ser Ser Leu Gly Pro 
35 40 45 

Glu Ser Thr Thr Phe His Ser Gly Pro Gly Ser Thr Glu Thr Thr Leu 



50 



55 



60 



Leu Pro Asp Asn Thr Thr Ala Ser Gly Leu Leu Glu Ala Ser Thr Pro 

75 80 



65 



70 



Thr Thr Leu Ser Pro Ala Gly 



95 



Val His Ser Ser Thr Gly Ser Pro Hit 

85 9° 

Ser Thr Thr Arg Gin Gly Glu Ser Thr Thr Phe Gin Ser Trp Pro Asn 
100 105 HO 

Ser Lys Asp Thr Thr Pro Ala Pro Pro Thr Thr Thr Ser Ala Phe Val 



115 



120 



125 



Glu Leu Ser Thr Thr Ser His Gly Ser Pro Ser Ser Thr Pro Thr Thr 



130 



His Phe Ser Ala Se 
145 



135 



140 



r Ser Thr Thr Leu Gly Arg Ser Glu Glu Ser Thr 



150 



155 



160 



Thr Ala Thr Thr Pro Ser Pro Ala 



175 



Thr Val His Ser Ser Pro Val Ala 

165 I 70 

Arg ser Thr Thr Ser Gly Leu Val Glu Glu Ser Thr Thr Tyr His Ser 
180 185 190 

Ser Pro Gly Ser Thr Gin Thr Met His Phe Pro Glu Ser Asp Thr Thr 



195 



200 



205 



Ser Gly Arg Gly Glu Glu Ser Thr Thr Ser His Ser Ser Thr Thr His 
210 215 220 



WO 00/04142 PCT/AU99/00579 



Thr lie Ser Ser Ala Pro Ser Thr Thr Ser Ala Leu Val Glu Glu Pro 
225 230 235 240 

Thr Ser Tyr His Ser Ser Pro Gly Ser Thr Ala Thr Thr His Phe Pro 
245 250 255 

Asp Ser Ser Thr Thr Ser Gly Arg Ser Glu Glu Ser Thr Ala Ser His 
260 265 270 

Ser Asn Gin Asp Ala Thr Gly Thr lie Val Leu Pro Ala Arg Ser Thr 
275 280 285 

Thr Ser Val Leu Leu Gly Glu Ser Thr Thr Ser Pro He Ser Ser Gly 
290 295 300 



Ser 



Met Glu Thr Thr Ala Leu Pro Gly Ser Thr Thr Thr Pro Gly Leu 

305 310 -3 J- 3 

Ser Glu Lys Ser Thr Thr Phe His Ser Ser Pro Arg Ser Pro Ala Thr 

325 330 335 

Thr Leu Ser Pro Ala Ser Thr Thr Ser Ser Gly Val Ser Glu Glu Ser 
340 345 350 



Thr 



Thr Ser His Ser Arg Pro Gly Ser Thr His Thr Thr Ala Phe Pro 



355 



360 365 



Asp Ser Thr Thr Thr Pro Gly Leu Ser Arg His Ser Thr Thr Ser His 



370 



375 



380 



Ser Ser Pro Gly Ser Thr Asp Thr Thr Leu Leu Pro Ala Ser Thr Thr 



385 



390 



395 



400 



Thr Ser Gly Pro Ser Gin Glu Ser Thr Thr Ser His Ser Ser Pro Gly 



405 410 415 



Ser Thr Asp Thr Ala Leu Ser Pro 
420 



Gly Ser Thr Thr Ala Leu Ser Phe 
425 430 



Gly Gin Glu Ser Thr Thr Phe His Ser Ser Pro Gly Ser Thr His Thr 
435 440 445 

Thr Leu Phe Pro Asp Ser Thr Thr Ser Ser Gly He Val Glu Ala Ser 
450 455 460 



Thr Gly Ser Pro Arg Thr Thr Leu Ser Pro 

465 470 



Thr Arg Val His Ser Ser 

475 480 



WO 00/04142 PCT/AU99/00579 



Ala Ser Ser Thr Ser Pro Gly Leu Gin Gly Glu Ser Thr Ala Phe Gin 
485 490 495 

Thr His Pro Ala Ser Thr His Thr Thr Pro Ser Thr Pro Ser Thr Ala 
500 505 510 

Thr Ala Pro Val Glu Glu Ser Thr Thr Tyr His Arg Ser Pro Ser Ser 
515 520 525 

Thr Pro Thr Thr His Phe Pro Ala Ser Ser Thr Thr Ser Gly His Ser 
530 535 540 

Glu Lys Ser Thr He Phe His Ser Ser Pro Asp Ala Ser Gly Thr Thr 
545 550 555 560 

Pro Ser Ser Ala His Ser Thr Thr Ser Gly Arg Gly Glu Ser Thr Thr 
565 570 575 

ser Arg He Ser Pro Gly Ser Thr Glu He Thr Thr Leu Pro Gly Ser 
580 585 590 

Thr Thr Thr Pro Gly Leu Ser Glu Ala Ser Thr Thr Phe Tyr Ser Ser 
595 600 605 

Pro Arg Ser Pro Thr Thr Thr Leu Ser Pro Ala Ser Met Thr Ser Leu 
610 615 620 



Gly Val Gly Glu Glu Ser Thr Thr Ser Arg Ser Gin Pro Gly Ser Thr 
625 

His Ser Thr Val Ser Pro Ala Ser Thr Thr Thr Pro Gly Leu Ser Glu 
645 650 655 

Glu Ser Thr Thr Val Tyr Ser Ser Ser Pro Gly Ser Thr Glu Thr Thr 
660 665 670 

Val Phe Pro Arg Ser Thr Thr Thr Ser Val Arg Gly Glu Glu Pro Thr 



675 



680 685 



Thr Phe His Ser Arg Pro Ala Ser Thr His Thr Thr Leu Phe Thr Glu 
690 



695 700 



Asp Ser Thr Thr Ser Gly Leu Thr Glu Glu Ser Thr Ala Phe Pro Gly 
705 710 715 720 

Ser Pro Ala Ser Thr Gin Thr Gly Leu Pro Ala Thr Leu Thr Thr Ala 
725 730 735 
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Asp Leu Gly Glu Glu Ser Thr Thr Phe Pro Ser Ser Ser Gly Ser Thr 
740 745 750 

Gly Thr Thr Leu Ser Pro Ala Arg Ser Thr Thr Ser Gly Leu Val Gly 
755 760 765 

Glu Ser Thr Pro Ser Arg Leu Ser Pro Ser Ser Thr Glu Thr Thr Thr 
770 775 780 

Leu Pro Gly Ser Pro Thr Thr Pro Ser Leu Ser Glu Lys Ser Thr Thr 
785 790 795 800 

Phe Tyr Thr Ser Pro Arg Ser Pro Asp Ala Thr Leu Ser Pro Ala Thr 
805 810 815 

Thr Thr Ser Ser Gly Val Ser Glu Glu Ser Ser Thr Ser His Ser Gin 
820 825 830 

Pro Gly Ser Thr His Thr Thr Ala Phe Pro Asp Ser Thr Thr Thr Ser 
835 840 845 

Gly Leu Ser Gin Glu Pro Lys Thr Ser His Ser Ser Gin Gly Ser Thr 
850 855 860 

Glu Ala Thr Leu Ser Pro Gly Ser Thr Thr Ala Ser Ser Leu Gly Gin 
865 870 875 88 0 

Gin Ser Thr Thr Phe His Ser Ser Pro Gly Asp Thr Glu Thr Thr Leu 
885 89° 895 

Leu Pro Asp Asp Thr He Thr Ser Gly Leu Val Glu Ala Ser Thr Pro 
900 905 910 

Thr His Ser Ser Thr Gly Ser Leu His Thr Thr Leu Thr Pro Ala Ser 
915 920 925 

Ser Thr Ser Ala Gly Leu Gin Glu Glu Ser Thr Thr Phe Gin Ser Trp 
930 935 940 

Pro Ser Ser Ser Asp Thr Thr Pro Ser Pro Pro Gly Pro 
945 950 955 



<210> 4 
<211> 28 
<212> PRT 

<213> Homo sapiens 



WO 00/04142 PCT/AU99/00579 



<220> 

<221> REPEAT 
<222> (1) . . (28) 

<223> MUC12 consensus tandem repeat sequence 
<400> 4 

Ser Gly Leu Ser Gin Glu Ser Thr Thr Phe His Ser Ser Pro Gly Ser 

Thr Glu Thr Thr Leu Ser Pro Ala Ser Thr Thr Thr 
20 25 



<210> 5 

<211> 2095 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (3) . . (1757) 

<400> 5 

aa aca etc tea cct gec age atg aga age tec age ate agt gga gaa 

Thr Leu Ser Pro Ala Ser Met Arg Ser Ser Ser lie Ser Gly Glu 

15 10 15 

ccc ace age ttg tat age eaa gca gag tea aca cae aca aca gcg ttc 
Pro Thr Ser Leu Tyr Ser Gin Ala Glu Ser Thr His Thr Thr Ala Phe 
20 25 30 

cct gee age ace ace ace tea ggc etc agt cag gaa tea aca act ttc 
Pro Ala Ser Thr Thr Thr Ser Gly Leu Ser Gin Glu Ser Thr Thr Phe 
35 40 45 

cae agt aag cca ggc tea act gag aca aca ctg tec cct ggc age ate 
His Ser Lys Pro Gly Ser Thr Glu Thr Thr Leu Ser Pro Gly Ser He 
50 55 60 

aca act tea tct ttt get caa gaa ttt ace acc cct cat age caa cca 
Thr Thr Ser Ser Phe Ala Gin Glu Phe Thr Thr Pro His Ser Gin Pro 
65 ™ 75 

ggc tea get ctg tea aca gtg tea cct gee age acc aca gtg cca ggc 
Gly Ser Ala Leu Ser Thr Val Ser Pro Ala Ser Thr Thr Val Pro Gly 
80 85 90 95 



47 



95 



143 



191 



239 



287 
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Leu Ser Glu Glu Ser Thr Thr Phe Tyr Ser Ser Pro Gly Ser Thr Glu 
100 105 110 

acc aca gcg ttt tct cac age aac aca atg tec att cat agt caa caa 
Thr Thr Ala Phe Ser His Ser Asn Thr Met Ser He His Ser Gin Gin 
115 120 125 

tct aca ccc ttc cct gac age cca ggc ttc act cac aca gtg tta cct 
Ser Thr Pro Phe Pro Asp Ser Pro Gly Phe Thr His Thr Val Leu Pro 
130 135 140 

gec acc etc aca acc aca gac att ggt cag gaa tea aca gee ttc cac 
Ala Thr Leu Thr Thr Thr Asp He Gly Gin Glu Ser Thr Ala Phe His 
145 150 155 

age age tea gac gca act gga aca aca ccc tta cct gee cgc tec aca 
Ser Ser Ser Asp Ala Thr Gly Thr Thr Pro Leu Pro Ala Arg Ser Thr 
160 165 170 175 

gec tea gac ctt gtt gga gaa cct aca act ttc tac ate age cca tec 575 
Ala Ser Asp Leu Val Gly Glu Pro Thr Thr Phe Tyr He Ser Pro Ser 
180 185 19° 



cct act tac aca aca etc ttt cct gcg agt tec age aca tea ggc etc 
Pro Thr Tyr Thr Thr Leu Phe Pro Ala ser Ser Ser Thr Ser Gly Leu 
195 200 205 

act gag gaa tct acc acc ttc cac acc agt cca age ttc act tct aca 
Thr Glu Glu Ser Thr Thr Phe His Thr Ser Pro Ser Phe Thr Ser Thr 
210 215 220 

att gtg tct act gaa age ctg gaa acc tta gca cca ggg ttg tgc cag 
lie Val Ser Thr Glu Ser Leu Glu Thr Leu Ala Pro Gly Leu Cys Gin 
225 230 235 

gaa gga caa att tgg aat gga aaa caa tgc gtc tgt ccc caa ggc tac 
Glu Gly Gin He Trp Asn Gly Lys Gin Cys Val Cys Pro Gin Gly Tyr 
240 245 250 255 

gtt ggt tac cag tgc ttg tec cct ctg gaa tec ttc cct gta gaa ace 
Val Gly Tyr Gin Cys Leu Ser Pro Leu Glu Ser Phe Pro Val Glu Thr 
260 265 2-70 

ccg gaa aaa etc aac gec act tta ggt atg aca gtg aaa gtg act tac 
Pro Glu Lys Leu Asn Ala Thr Leu Gly Met Thr Val Lys Val Thr Tyr 
275 280 285 



383 



431 



479 



527 



623 



671 



719 



767 



815 



863 
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Arg Asn Phe Thr Glu Lys Met Asn Asp Ala Ser Ser Gin Glu Tyr Gin 
290 295 300 



aac ttc agt acc etc ttc aag aat egg atg gat gtc gtt ttg aag ggc 
Asn Phe Ser Thr Leu Phe Lys Asn Arg Met Asp Val Val Leu Lys Gly 
305 310 315 



959 



gac aat ctt cct cag tat aga ggg gtg aac att egg aga ttg etc aac 
Asp Asn Leu Pro Gin Tyr Arg Gly Val Asn lie Arg Arg Leu Leu Asn 
320 325 330 335 

ggt age ate gtg gtc aag aac gat gtc ate ctg gag gca gac tac act 
Gly Ser lie Val Val Lys Asn Asp Val lie Leu Glu Ala Asp Tyr Thr 
340 345 350 



1007 



1055 



tta gag tat gag gaa ctg ttt gaa aac ctg gca gag att gta aag gec 
Leu Glu Tyr Glu Glu Leu Phe Glu Asn Leu Ala Glu He Val Lys Ala 

360 365 



355 

aag att atg aat gaa act aga aca act ctt ett gat cct gat tec tgc 
Lys He Met Asn Glu Thr Arg Thr Thr Leu Leu Asp Pro Asp Ser Cys 
370 375 380 

aga aag gee ata ctg tgc tat agt gaa gag gac act ttc gtg gat tea 
Arg Lys Ala He Leu Cys Tyr Ser Glu Glu Asp Thr Phe Val Asp Ser 
385 390 395 

teg gtg act ccg ggc ttt gac ttc cag gag caa tgc acc cag aag get 
Ser Val Thr Pro Gly Phe Asp Phe Gin Glu Gin cys Thr Gin Lys Ala 
400 405 410 415 

gec gaa gga tat ace cag ttc tac tat gtg gat gtc ttg gat ggg aag 
Ala Glu Gly Tyr Thr Gin Phe Tyr Tyr Val Asp Val Leu Asp Gly Lys 
420 425 430 

ctg gee tgt gtg aac aag tgc acc aaa gga acg aag teg caa atg aac 
Leu Ala Cys Val Asn Lys Cys Thr Lys Gly Thr Lys Ser Gin Met Asn 



435 



440 



445 



tgt aac ctg ggc aca tgt cag ctg caa cgc agt ggc ccc cgc tgc ctg 
Cys Asn Leu Gly Thr Cys Gin Leu Gin Arg Ser Gly Pro Arg Cys Leu 
450 455 



460 



tgc cca aat acg aac aca cac tgg tac tgg gga gag aec tgt gaa ttc 
cys Pro Asn Thr Asn Thr His Trp Tyr Trp Gly Glu Thr Cys Glu Phe 

470 475 



1103 



1151 



1199 



1247 



1295 



1343 



1391 



1439 



465 



-i a o n 
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Asn lie Ala Lys Ser Leu Val Tyr Gly lie Val Gly Ala Val Met Ala 
480 485 490 495 

gtg ctg ctg etc gca ttg ate ate eta ate ate tta ttc age eta tec 1535 
Val Leu Leu Leu Ala Leu He He Leu He He Leu Phe Ser Leu Ser 
500 505 510 

cag aga aaa egg cac agg gaa cag tat gat gtg cct caa gag tgg cga 1583 
Gin Arg Lys Arg His Arg Glu Gin Tyr Asp Val Pro Gin Glu Trp Arg 
515 520 525 

aag gaa ggc acc cct ggc ate ttc cag aag acg gee ate tgg gaa gac 1631 
Lys Glu Gly Thr Pro Gly He Phe Gin Lys Thr Ala He Trp Glu Asp 
530 535 540 

cag aat ctg agg gag age aga ttc ggc ctt gag aac gee tac aac aac 1679 
Gin Asn Leu Arg Glu Ser Arg Phe Gly Leu Glu Asn Ala Tyr Asn Asn 
545 550 555 

ttc egg cee acc ctg gag act gtt gac tet ggc aca gag etc cac ate 1727 
Phe Arg Pro Thr Leu Glu Thr Val Asp Ser Gly Thr Glu Leu His He 
560 565 570 575 

cag agg ccg gag atg gta gca tee act gtg tgagccaaeg ggggectcee 1777 
Gin Arg Pro Glu Met Val Ala Ser Thr Val 
580 585 

acecteatct agctctgttc aggagagctg caaaeacaga gcecaccaca agcetceggg 1837 
gegggtcaag aggagaccga agtcaggccc tgaagecggt cetgetctga gctgacagae 1897 
ttggecagtc ccetgcctgt getectgetg gggaaggctg ggggctgtaa gectetecat 1957 
cegggagctt ecagactcce agaagectcg geacecctgt ctcctcctgg gtggcteccc 2017 
actctggaat ttccetacca ataaaagcaa atctgaaagc tcaaaaaaaa aaaaaaaaaa 2077 

2095 

aaaaaaaaaa aaaaaaaa 



<210> 6 
<211> 585 
<212> PRT 

<213> Homo sapiens 

Thr°Le! ser Pro Ala Ser Met Arg Ser Ser Ser He Ser Gly Glu Pro 

- _ 1 C 
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Thr Ser Leu Tyr Ser Gin Ala Glu Ser Thr His Thr Thr Ala Phe Pro 
20 25 30 

Ala ser Thr Thr Thr Ser Gly Leu Ser Gin Glu Ser Thr Thr Phe His 
35 40 45 

Ser Lys Pro Gly Ser Thr Glu Thr Thr Leu Ser Pro Gly Ser He Thr 
50 55 6° 

Thr Ser Ser Phe Ala Gin Glu Phe Thr Thr Pro His Ser Gin Pro Gly 
65 70 75 80 

Ser Ala Leu Ser Thr Val Ser Pro Ala Ser Thr Thr Val Pro Gly Leu 
85 90 95 

Ser Glu Glu Ser Thr Thr Phe Tyr Ser Ser Pro Gly Ser Thr Glu Thr 
100 105 no 

Thr Ala Phe Ser His Ser Asn Thr Met Ser lie His Ser Gin Gin Ser 
115 120 I 25 

Thr Pro Phe Pro Asp Ser Pro Gly Phe Thr His Thr Val Leu Pro Ala 
130 135 I 40 

Thr Leu Thr Thr Thr Asp He Gly Gin Glu Ser Thr Ala Phe His Ser 
145 150 155 I 60 



Ser Ser Asp Ala Thr Gly Thr Thr Pro Leu 



Pro Ala Arg Ser Thr Ala 



165 170 

Ser Asp Leu Val Gly Glu Pro Thr Thr Phe Tyr He Ser Pro Ser Pro 
180 185 190 

Thr Tyr Thr Thr Leu Phe Pro Ala Ser Ser Ser Thr Ser Gly Leu Thr 
195 200 205 

Glu Glu Ser Thr Thr Phe His Thr Ser Pro Ser Phe Thr Ser Thr He 
210 215 220 

val Ser Thr Glu Ser Leu Glu Thr Leu Ala Pro Gly Leu Cys Gin Glu 
225 230 235 240 



Gly Gin He Tr P Asn Gly Lys Gin Cys Val Cys Pro Gin Gly Tyr Val 
245 



250 255 



Gly Tyr Gin Cys Leu Ser Pro Leu Glu Ser 



Phe Pro Val Glu Thr Pro 
Tin 
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Glu Lys Leu Asn Ala Thr Leu Gly Met Thr Val Lys Val Thr Tyr Arg 
275 280 285 

Asn Phe Thr Glu Lys Met Asn Asp Ala Ser Ser Gin Glu Tyr Gin Asn 
290 295 300 

Phe Ser Thr Leu Phe Lys Asn Arg Met Asp Val Val Leu Lys Gly Asp 
305 310 315 320 

Asn Leu Pro Gin Tyr Arg Gly Val Asn He Arg Arg Leu Leu Asn Gly 
325 330 335 

Ser He Val Val Lys Asn Asp Val He Leu Glu Ala Asp Tyr Thr Leu 
340 345 350 

Glu Tyr Glu Glu Leu Phe Glu Asn Leu Ala Glu He Val Lys Ala Lys 
355 360 365 

He Met Asn Glu Thr Arg Thr Thr Leu Leu Asp Pro Asp Ser Cys Arg 
370 375 380 

Lys Ala He Leu Cys Tyr Ser Glu Glu Asp Thr Phe Val Asp Ser Ser 
385 390 395 400 

Val Thr Pro Gly Phe Asp Phe Gin Glu Gin Cys Thr Gin Lys Ala Ala 
405 410 415 

Glu Gly Tyr Thr Gin Phe Tyr Tyr Val Asp Val Leu Asp Gly Lys Leu 
420 425 430 

Ala Cys Val Asn Lys Cys Thr Lys Gly Thr Lys Ser Gin Met Asn cys 



435 



440 



445 



As 



n Leu Gly Thr Cys Gin Leu Gin Arg Ser Gly Pro Arg Cys Leu Cys 



450 



455 460 



Pro Asn Thr Asn Thr His Tr P Tyr Trp Gly Glu Thr Cys Glu Phe Asn 

Am 475 480 

465 470 HID 



He Ala Lys Ser Leu Val Tyr Gly He Val Gly Ala Val Met Ala Val 
485 



490 495 



Leu Leu Leu Ala Leu He He Leu lie He Leu Phe Ser Leu Ser Gin 
500 505 510 

Arg Lys Arg His Arg Glu Gin Tyr Asp Val Pro Gin Glu Trp Arg Lys 



WO 00/04142 PCT/AU99/00579 



Glu Gly Thr Pro Gly He Phe Gin Lys Thr Ala He Trp Glu Asp Gin 
530 535 540 

Asn Leu Arg Glu Ser Arg Phe Gly Leu Glu Asn Ala Tyr Asn Asn Phe 

555 560 



545 



550 



Arg Pro Thr Leu Glu Thr Val Asp Ser Gly Thr Glu Leu His He Gin 
565 570 575 



Arg Pro Glu Met Val Ala Ser Thr Val 
580 585 



<210> 7 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Random 1 
PCR primer 

<400> 7 
acttcgccac 



<210> 8 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :MUC 
forward primer 

<400> 8 

tgaagggcga caatcttcct c 



<210> 9 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : MUC12 r 
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<400> 9 

tacacgaggc tcttggcgat gttg 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
PCR primer 

<400> 10 

caggcgtcag tcaggaatct acag 

<210> 11 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



gaggctgtgg tgttgtcagg taag 



<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Beta2-microglobulin forward PCR primer 

<400> 12 

tgaattgcta tgtgtctggg t 



<220> 

<223> Description 
PCR primer 



of Artificial Sequence: Mucll reverse 



<400> 11 



24 



<210> 13 
<211> 21 
<212> DNA 
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<220> 

<223> Description of Artificial Sequence: Beta 
2-microglobulin reverse PCR primer 

<400> 13 

cctccatgat gctgcttaca t 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MUC12 forward 
primer for verification of contiguous sequence 

<400> 14 

20 

agccaaccag gctcagctct 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MUC12 reverse 
primer for verification of contiguous sequence 

<400> 15 

21 

gctcacacag tggatgctac c 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Cl 
reverse PCR primer 



<400> 16 

gggaacactg tggtttcagt tgag 



24 
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<210> 17 
<211> 17 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> PEPTIDE 
<222> (1) . . (17) 

<223> MUCH immunizing peptide 



<400> 17 

Cys Phe His Ser Arg Pro Ala Ser Thr His Thr Thr Leu Phe Thr Glu 
15 10 15 



Asp 



<210> 18 
<211> 17 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> PEPTIDE 
<222> (1) . . (17) 

<223> MUC12 immunizing peptide 



<400> 18 

Thr Tyr Arg Asn Phe Thr Glu Lys Met Asn Asp Ala Ser Ser Gin Glu 

* 10 I 5 



Cys 
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